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NUCLEIC ACID SEQUENCES FOR ATP-BINDING CASSETTE TRANSPORTER 

5 



BACKGROUND OF THE INVENTION 

Macular degeneration affects approximately 1 .7 million individuals in the U.S. 

10 and is the most common cause of acquired visual impairment in those over the age of 65. 
Stargardt disease (STGD; McKusick Mendelian Inheritance (MIM) #248200) is arguably the 
most common hereditary recessive macular dystrophy and is characterized by juvenile to 
young adult onset, central visual impairment, progressive bilateral atrophy of the macular 
retinal pigment epithelium (RPE) and neuroepithelium, and the frequent appearance of 

1 5 orange-yellow flecks distributed around the macula and/or the midretinal periphery (Stargardt, 
1909; Anderson et ai 9 1995). A clinically similar retinal disorder (Fundus Flavimaculatus, 
FFM, Franceschetti, 1963) often displays later age of onset and slower progression (Fishman, 
1976; Noble and Carr, 1979). From linkage analysis, it has been concluded that STGD and 
FFM are most likely allelic autosomal recessive disorders with slightly different clinical 

20 manifestations caused by mutation(s) of a gene at chromosome Ipl3-p21 (Gerber et al. 9 1995; 
Anderson et al, 1 995). The STGD gene has been localized to a 4 cM region flanked by the 
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recombinant markers D1S435 and D1S236 and a complete yeast artificial chromosome (YAC) 
contig of the region has been constructed (Anderson et al, 1995). Recently, the location of 
the STGD/FFM locus on human chromosome Ip has been refined to a 2 cM interval between 
polymorphic markers D1S406 and D1S236 by genetic linkage analysis in an independent set 
5 of STGD families (Hoyng et aL, 1996). Autosomal dominant disorders with somewhat 
similar clinical phenotypes to STGD, identified in single large North American pedigrees, 
have been mapped to chromosome 13q34 (STGD2; MIM #153900; Zhang et al, 1994) and 
to chromosome 6qll-ql4 (STGD3; MIM #600110; Stone et al, 1994), although these 
conditions are not characterized by the pathognomonic dark choroid observed by fluorescein 

1 0 angiography (Gass, 1987). 

Members of the superfamily of mammalian ATP binding cassette (ABC) 
transporters are being considered as possible candidates for human disease phenotypes. The 
ABC superfamily includes genes whose products are transmembrane proteins involved in 
energy-dependent transport of a wide spectrum of substrates across membranes (Childs and 

15 Ling, 1994; Dean and Allikmets, 1995). Many disease-causing members of this superfamily 
result in defects in the transport of specific substrates (CFTR, Riordan et aL y 1989; ALD, 
Mosser etai, 1993; SUR, Thomas etal., 1995; PMP70, Shimozawa et al. 9 1992; TAP2, de 
la Salle et aL, 1994). In eukaryotes, ABC genes encode typically four domains that include 
two conserved ATP-binding domains (ATP) and two domains with multiple transmembrane 

2 0 (TM) segments (Hyde et aL 1990). The ATP-binding domains of ABC genes contain motifs 

of characteristic conserved residues (Walker A and B motifs) spaced by 90-120 amino acids. 
Both this conserved spacing and the "Signature" or "C" motif just upstream of the Walker B 
site distinguish members of the ABC superfamily from other ATP-binding proteins (Hyde et 
al., 1990; Michaelis and Berkower, 1995). These features have allowed the isolation of new 
25 ABC genes by hybridization, degenerate PCR, and inspection of DNA sequence databases 
(Allikmets et a!., 1993, 1995; Dean et ai, 1994; Luciani et al y 1994). 

The characterization of twenty-one new members of the ABC superfamily may 
permit characterization and functions assigned to these genes by determining their map 
locations and their patterns of expression (Allikmets et a/., 1996). That many known ABC 

3 0 genes are involved in inherited human diseases suggests that some of these new loci will also 

encode proteins mutated in specific genetic disorders. Despite regionally localizing a gene 
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by mapping, the determination of the precise localization and sequence of one gene 
nonetheless requires choosing the certain gene from about 250 genes, four to about five 
million base pairs, from within the regionally localized chromosomal site. 

While advancements have been made as described above, mutations in retina- 
5 specific ABC transporter (ABCR) in patients with recessive macular dystrophy STGD/FFM 
have not yet been identified to Applicant's knowledge. That ABCR expression is limited to 
photoreceptors, as determined by the present invention, provides evidence as to why ABCR 
has not yet been sequenced. Further, the ABC1 subfamily of ABC transporters is not 
represented by any homolog in yeast (Michaelis and Berkower, 1995), suggesting that these 
1 0 genes evolved to perform specialized functions in multicellular organisms, which also lends 
support to why the ABCR gene has been difficult to identify. Unlike ABC genes in bacteria, 
the homologous genes in higher eukaryotes are much less well studied. The fact that 
prokaryotes contain a large number of ABC genes suggests that many mammalian members 
of the superfamily remain uncharacterized. The task of studying eukaryote ABC genes is 
1 5 more difficult because of the significantly higher complexity of eukaryotic systems and the 
apparent difference in function of even highly homologous genes. While ABC proteins are 
the principal transporters of a number of diverse compounds in bacterial cells, in contrast, 
eukaryotes have evolved other mechanisms for the transport of many amino acids and sugars. 
Eukaryotes have other reasons to diversify the role of ABC genes, for example, performing 
2 0 such functions as ion transport, toxin elimination, and secretion of signaling molecules. 

Accordingly, there remains a need for the identification of the sequence of the 
gene, which in mutated forms is associated with retinal and/or macular degenerative diseases, 
including Stargardt Disease and Fundus Flavimaculatus, for example, in order to provide 
enhanced diagnoses and improved prognoses and interventional therapies for individuals 
2 5 affected with such diseases. 

SUMMARY OF THE INVENTION 

The present invention provides sequences encoding an ATP binding cassette 
transporter. Nucleic acid sequences, including SEQ ID NO: 1 which is a genomic sequence, 
and SEQ ID NOS: 2 and 5 which are cDNA sequences, are sequences to which the present 
I.$ 6 : invention is directed. 
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A further aspect of the present invention provides ATP binding cassette 
transporter polypeptides and/or proteins. SEQ ID NOS: 3 and 6 are novel polypeptides of the 
invention produced from nucleotide sequences encoding the ATP binding cassette transporter. 
Also within the scope of the present invention is a purified ATP binding cassette transporter. 
5 The present invention also provides an expression vector comprising a nucleic 

acid sequence encoding an ATP binding cassette transporter, a transformed host cell capable 
of expressing a nucleic acid sequence encoding an ATP binding cassette transporter, a cell 
culture capable of expressing an ATP binding cassette transporter, and a protein preparation 
comprising an ATP binding cassette transporter. 

1 o The present invention is also directed to a method of screening for an agent that 

modifies ATP binding cassette transporter comprising combining purified ATP binding 
cassette transporter with an agent suspected of modifying ATP binding cassette transporter 
and observing a change in at least one characteristic associated with ATP binding cassette 
transporter. The present invention provides methods of identifying an agent that inhibits 

15 macular degeneration comprising combining purified ATP binding cassette transporter from 
a patient suspected of having macular degeneration and an agent suspected interacting with 
the ATP binding cassette transporter and observing an inhibition in at least one of the 
characteristics of diseases associated with the ATP binding cassette transporter. In addition, 
the present invention provides for methods of identifying an agent that induces onset of at 

20 least one characteristic associated with ATP binding cassette transporter comprising 
combining purified wild-type ATP binding cassette transporter with an agent suspected of 
inducing a macular degenerative disease and observing the onset of a characteristic associated 
with macular degeneration. 

BRIEF DESCRIPTION OF THE FIGURES 

25 Figure 1A and IB displays the ABCR gene and amplification products. 

Figure 1 A displays a physical map of the ABCR gene. Mega- Y AC clones from the CEPH 
mega-YAC genomic library (Bellane-Chantelot et a/., 1992) encompassing the 4cM critical 
region for STGD are represented by horizontal bars with shaded circles indicating confirmed 
positives for STSs by landmark mapping. The individual STS markers and their physical 

3 0 order are shown below the YACs with arrows indicating the centromeric (cen) and telomeric 
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(Ipter) direction (Anderson et ai, 1995). The horizontal double head arrow labeled STGD 
indicates the refined genetic interval delineated by historical recombinants (Anderson et al, 
1995). Figure IB displays the results of agarose gel electrophoresis of PCR amplification 
products with primers from the 5' (GGTCTTCGTGTGTGGTCATT, SEQ ID NO: 114, 
5 GGTCCAGTTCTTCCAGAG, SEQ ID NO: 115, labeled 5' ABCR) or 3* 
(ATCCTCTGACTCAGCAATCACA, SEQ ID NO: 1 1 6, TTGCAATTAC AAATGC AATGG, 
SEQ ID NO: 117, labeled 3' ABCR) regions of ABCR on the 13 different YAC DNA 
templates indicated as diagonals above the gel. The asterisk denotes that YAC 680_b_5 was 
positive for the 5* ABCR PCR but negative for the 3* ABCR PCR. These data suggest the 
0 ABCR gene maps within the interval delineated by markers D1S3361 - D1S236 and is 
transcribed toward the telomere, as depicted by the open horizontal box. 

Figure 2 exhibits the size and tissue distribution of ABCR transcripts in the 
adult rat. A blot of total RNA from the indicated tissues was hybridized with a 1 .6 kb mouse 
Abcr probe (top) and a ribosomal protein S26 probe (bottom; Kuwano et aL 9 1985). The 
5 ABCR probe revealed a predominant transcript of approximately 8 kb that is found in retina 
only. The mobility of the 28S and 18S ribosomal RNAs are indicated at the right. B, brain; 
H, heart; K, kidney; Li, liver; Lu, lung; R, retina; S, spleen. 

Figure 3 shows the sequence of the ABCR coding region within the genomic 
ABCR sequence, SEQ ID NO: 1 . The sequence of the ABCR cDNA, SEQ ID NO: 2, is shown 
0 with the predicted protein sequence, SEQ ID NO: 3, in one-letter amino acid code below. The 
location of splice sites is shown by the symbol |. 

Figure 4 displays the alignment of the ABCR protein, SEQ ID NO: 3, with 
other members of the ABC 1 subfamily. The deduced amino acid sequence of ABCR is shown 
aligned to known human and mouse proteins that are members of the same subfamily. Abel, 
5 mouse Abe 1 , Abc2, mouse Abc2, and ABCC, human ABC gene. The Walker A and B motifs 
and the Signature motif C are designated by underlining and the letters A, B, and C, 
respectively. 

Figure 5 exhibits the location of Abcr from a Jackson BSS Backcross showing 
a portion of mouse chromosome 3. The map is depicted with the centromere toward the top. 
SO A 3 cM scale bar is also shown. Loci mapping to the same position are listed in alphabetical 
order. 
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Figure 6 shows the segregation of SSCP variants in exon 49 of the ABCR gene 
in kindred AR293. Sequence analysis of SSCP bands revealed the existence of wild-type 
sequence (bands 1 and 3) and mutant sequence (bands 2 and 4). DNA sequencing revealed 
a 15 base pair deletion, while the affected children (lanes 2 and 3) are homozygous. 
5 Haplotype analysis demonstrated homozygosity at the STGD locus in the two affected 
individuals. 

Figure 7A-H shows the localization of ABCR transcripts to photoreceptor 
cells. In situ hybridization was performed with digoxygeniq-labeled riboprobes and visualized 
using an alkaline phosphatase conjugated anti-digoxygenin antibody. Figure 7A-D displays 

1 0 hybridization results of retina and choroid from a pigmented mouse (C57/B16); Figure 7E and 
7F shows hybridization results of retina and choroid from an albino rat; and Figure 7G and 
7H exhibits hybridization results of retina from a macaque monkey. Figure 7A, 7E, and 7G 
display results from a mouse abcr antisense probe; Figure 7B exhibit results from a mouse 
abcr sense probe; Figure 7C shows results from a macaque rhodopsin antisense probe; and 

15 Figure 7D, 7F, and 7H display results from a mouse blue cone pigment antisense probe. 
ABCR transcripts are localized to the inner segments of the photoreceptor cell layer, a pattern 
that matches the distribution of rhodopsin transcripts but is distinct from the distribution of 
cone visual pigment transcripts. Hybridization is not observed in the RPE or choroid, as seen 
most clearly in the albino rat eye (arrowhead in Figure 7E). The retinal layers indicated in 

2 0 Figure 7B are: OS, outer segments; IS, inner segments; ONL, outer nuclear layer; OPL, outer 
plexiform layer; INL, inner nuclear layer; IPL, inner plexiform layer; GCL, ganglion cell 
layer. 

Figure 8 provides a pGEM*-T Vector map. 

DETAILED DESCRIPTION OF THE INVENTION 

2 5 The present invention is directed to the nucleic acid and protein sequences 

encoding ATP binding cassette transporter. The ATP binding cassette transporter of the 
present invention is retina specific ATP binding cassette transporter (ABCR); more 
particularly, ABCR may be isolated from retinal cells, preferably photoreceptor cells. The 
present invention provides nucleotide sequences of ABCR including genomic sequences, SEQ 

3 0 ID NO: 1 , and cDNA sequences SEQ ID NO: 2 and 5. Novel polypeptide sequences, SEQ 
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ID NOS: 3 and 6, for ABCR, are the translated products of SEQ ID NOS: 2 and 5, 
respectively, and are also included in the present invention. 

SEQ ID NO: 1 provides the human genomic DNA sequence of ABCR. SEQ 
5 ID NOS: 2 and 5 provide wild-type cDNA sequences of human ABCR, which result in 
translated products SEQ ID NOS: 3 and 6, respectively. While not intending to be bound by 
any particular theory or theories of operation, it is believed that SEQ ID NOS: 2 and 5 are 
isoforms of ABCR cDNA. The difference between SEQ ID NOS: 2 and 5 may be accounted 
for by an additional sequence in SEQ ID NO: 2 which is added between bases 4352 and 4353 
10 of SEQ ID NO: 5. This difference is thought to arise from alternative splicing of the nascent 
transcript of ABCR, in which an alternative exon 30, SEQ ID NO: 4, is excluded. This 
alternative exon encodes an additional 38 amino acids, SEQ ID NO: 1 1 . 

Nucleic acids within in the scope of the present invention include cDN A, RNA, 
genomic DNA, fragments or portions within the sequences, antisense oligonucleotides. 
15 Sequences encoding the ABCR also include amino acid, polypeptide, and protein sequences. 
Variations in the nucleic acid and polypeptide sequences of the present invention are within 
the scope of the present invention and include N terminal and C terminal extensions, 
transcription and translation modifications, and modifications in the cDNA sequence to 
facilitate and improve transcription and translation efficiency. In addition, changes within the 
2 0 wild-type sequences identified herein which changed sequence retains substantially the same 
wild-type activity, such that the changed sequences are substantially similar to the ABCR 
sequences identified, are also considered within the scope of the present invention. 
Mismatches, insertions, and deletions which permit substantial similarity to the ABCR 
sequences, such as similarity in residues in hydrophobicity, hydrophilicity, basicity, and 

2 5 acidity, will be known to those of skill in the art once armed with the present disclosure. In 

addition, the isolated, or purified, sequences of the present invention may be natural, 
recombinant, synthetic, or a combination thereof. Wild-type activity associated with the 
ABCR sequences of the present invention include, inter alia, all or part of a sequence, or a 
sequence substantially similar thereto, that codes for ATP binding cassette transporter. 

3 0 The genomic, SEQ ID NO: 1 , and cDNA, SEQ ID NOS: 2 and 5, sequences 

are identified in Figure 3 and encode ABCR, certain mutations of which are responsible for 
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the class of retinal disorders known as retinal or macular degenerations. Macular degeneration 
is characterized by macular dystrophy, various alterations of the peripheral retina, central 
visual impairment, progressive bilateral atrophy of the macular retinal pigment epithelium 
(RPE) and neuroepithelium, frequent appearance of orange-yellow flecks distributed around 
5 the macula and/or the midretinal periphery, and subretinal deposition of lipofuscin-like 
material. Retinal and macular degenerative diseases include and are not limited to Stargardt 
Disease, Fundus Flavimaculatus, age-related macular degeneration, and may include disorders 
variously called retinitis pigmentosa, combined rod and cone dystrophies, cone dystrophies 
and degenerations, pattern dystrophy, bull's eye maculopathies, and various other retinal 

1 0 degenerative disorders, some induced by drugs, toxins, environmental influences, and the like. 
Stargardt Disease is an autosomal recessive retinal disorder characterized by juvenile to adult- 
onset macular and retinal dystrophy. Fundus Flavimaculatus often displays later age of onset 
and slower progression. Some environmental insults and drug toxicities may create similar 
retinal degenerations. Linkage analysis reveals that Stargardt Disease and Fundus 

15 Flavimaculatus may be allelic autosomal recessive disorders with slightly different clinical 
manifestations. The identification of the ABCR gene suggests that different mutations within 
ABCR may be responsible for these clinical phenomena. 

The present invention is also directed to a method of screening for an agent that 
modifies ATP binding cassette transporter comprising combining purified ATP binding 

2 0 cassette transporter with an agent suspected of modifying ATP binding cassette transporter 
and observing a change in at least one characteristic associated with ATP binding cassette 
transporter. 

"Modify" and variations thereof include changes such as and not limited to 
inhibit, suppress, delay, retard, slow, suspend, obstruct, and restrict, as well as induce, 

2 5 encourage, provoke, and cause. Modify may also be defined as complete inhibition such that 

macular degeneration is arrested, stopped, or blocked. Modifications may, directly or 
indirectly, inhibit or substantially inhibit, macular degeneration or induce, or substantially 
induce, macular degeneration, under certain circumstances. 

Methods of identifying an agent that inhibits macular degeneration are 

3 Cf embodied by the present invention and comprise combining purified ATP binding cassette 

transporter from a patient suspected of having macular degeneration and an agent suspected 
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of interacting with the ATP binding cassette transporter and observing an inhibition in at least 
one of the characteristics of diseases associated with the ATP binding cassette transporter. 
Accordingly, such methods serve to reduce or prevent macular degeneration, such as in human 
patients. In addition, the present invention provides for methods of identifying an agent that 
5 induces onset of at least one characteristic associated with ATP binding cassette transporter 
comprising combining purified wild-type ATP binding cassette transporter with an agent 
suspected of inducing a macular degenerative disease and observing the onset of a 
characteristic associated with macular degeneration. Thus, such methods provide methods of 
using laboratory animals to determine causative agents of macular degeneration. The ATP 
1 o binding cassette transporter may be provided for in the methods identified herein in the form 
of nucleic acids, such as and not limited to SEQ ID NOS: 1 , 2, and 5 or as an amino acid, SEQ 
ID NOS: 3 and 6, for example. Accordingly, transcription and translation inhibitors may be 
separately identified. Characteristics associated with macular degeneration include and are 
not limited to central visual impairment, progressive bilateral atrophy of the macular retinal 
15 pigment epithelium (RPE) and neuroepithelium, and the frequent appearance of orange-yellow 
flecks distributed around the macula and/or the midretinal periphery. Accordingly, observing 
one or more of the characteristics set forth above results in identification of an agent that 
induces macular degeneration, whereas reduction or inhibition of at least one of the 
characteristics results in identification of an agent that inhibits macular degeneration. 
2 o Mutational analysis of ABCR in Stargardt Disease families revealed thus far 

seventy four mutations including fifty four single amino acid substitutions, five nonsense 
mutations resulting in early truncation of the protein, six frame shift mutations resulting in 
early truncation of the protein, three in-frame deletions resulting in loss of amino acid residues 
from the protein, and six splice site mutations resulting in incorrect processing of the nascent 
2 5 UNA transcript, see Table 2. Compound heterozygotes for mutations in ABCR were found 
in forty two families. Homozygous mutations were identified in three families with 
consanguineous parentage. Accordingly, mutations in wild-type ABCR which result in 
activities that are not associated with wild-type ABCR are herein referred to as sequences 
which are associated with macular degeneration. Such mutations include missense mutations, 
30 deletions, insertions, substantial differences in hydrophobicity, hydrophilicity, acidity, and 
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basicity. Characteristics which are associated with retinal or macular degeneration include 
and are not limited to those characteristics set forth above. 

Mutations in wild-type ABCR provide a method of detecting macular 
degeneration. Retinal or macular degeneration may be detected by obtaining a sample 
5 comprising patient nucleic acids from a patient tissue sample; amplifying retina-specific ATP 
binding cassette receptor specific nucleic acids from the patient nucleic acids to produce a test 
fragment; obtaining a sample comprising control nucleic acids from a control tissue sample; 
amplifying control nucleic acids encoding wild-type retina-specific ATP binding cassette 
receptor to produce a control fragment; comparing the test fragment with the control fragment 
10 to detect the presence of a sequence difference in the test fragment, wherein a difference in 
the test fragment indicates macular degeneration. Mutations in the test fragment, including 
and not limited to each of the mutations identified above, may provide evidence of macular 
degeneration. 

A purified ABCR protein is also provided by the present invention. The 
1 5 purified ABCR protein may have an amino acid sequence as provided by SEQ ID NOS: 3 and 
6. 

The present invention is directed to ABCR sequences obtained from mammals 
from the Order Rodentia, including and not limited to hamsters, rats, and mice; Order 
Logomorpha, such as rabbits; more particularly the Order Carnivora, including Felines (cats) 
2 0 and Canines (dogs); even more particularly the Order Artiodactyla, Bovines (cows) and 
Suines (pigs); and the Order Perissodactyla, including Equines (horses); and most particularly 
the Order Primates, Ceboids and Simoids (monkeys) and Anthropoids (humans and apes). 
The mammals of most preferred embodiments are humans. 

Generally, the sequences of the invention may be produced in host cells 

2 5 transformed with an expression vector comprising a nucleic acid sequence encoding ABCR. 

The transformed cells are cultured under conditions whereby the nucleic acid sequence coding 
for ABCR is expressed. After a suitable amount of time for the protein to accumulate, the 
protein may be purified from the transformed cells. 

A gene coding for ABCR may be obtained from a cDNA library. Suitable 

3 0 libraries can be obtained from commercial sources such as Clontech, Palo Alto, CA. Libraries 

may also be prepared using the following non-limiting examples: hamster insulin-secreting 
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tumor (HIT), mouse cjTC-6, and rat insulinoma (RIN) cells. Positive clones are then subjected 
to DNA sequencing to determine the presence of a DNA sequence coding for ABCR. DNA 
sequencing is accomplished using the chain termination method of Sanger et al , Proc. Natl 
Acad. Sci, USA., 1977, 74, 5463. The DNA sequence encoding ABCR is then inserted into 
5 an expression vector for later expression in a host cell. 

Expression vectors and host cells are selected to form an expression system 
capable of synthesizing ABCR. Vectors including and not limited to baculovims vectors may 
be used in the present invention. Host cells suitable for use in the invention include 
prokaryotic and eukaryotic cells that can be transformed to stably contain and express ABCR. 

10 For example, nucleic acids coding for the recombinant protein may be expressed in 
prokaryotic or eukaryotic host cells, including the most commonly used bacterial host cell for 
the production of recombinant proteins, E. coli. Other microbial strains may also be used, 
however, such as Bacillus subtilis, and other enterobacteriaceae such as Salmonella 
typhimurium or Serratia marcescens, various species of Pseudomonas, or other bacterial 

15 strains. 

The preferable eukaryotic system is yeast, such as Saccharomyces cerevisiae. 
Yeast artificial chromosome (Y AC) systems are able to accommodate the large size of ABCR 
gene sequence or genomic clone. The principle of the YAC system is similar to that used in 
conventional cloning of DNA. Large fragments of cDNA are ligated into two "arms" of a 

2 0 YAC vector, and the ligation mixture is then introduced into the yeast by transformation. 
Each of the arms of the YAC vector carries a selectable marker as well as appropriately 
oriented sequences that function as telomeres in yeast. In addition, one of the two arms carries 
two small fragments that function as a centromere and as an origin of replication (also called 
an ARS element-autonomously replicating sequences). Yeast transformants that have taken 

2 5 up and stably maintained an artificial chromosome are identified as colonies on agar plates 
containing the components necessary for selection of one or both YAC arms. YAC vectors 
are designed to allow rapid identification of transformants that carry inserts of genomic DNA. 
Insertion of genomic DNA into the cloning site interrupts a suppressor tRNA gene and results 
in the formation of red rather than white colonies by yeast strains that carry an amber adel 

30 gene. 
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To clone in YAC vectors, genomic DNA from the test organism is prepared 
under conditions that result in relatively little shearing such that its average size is several 
million base pairs. The cDNA is then ligated to the arms of the YAC vector, which has been 
appropriately prepared to prevent self-ligation. As an alternative to partial digestion with 
5 EcoRI, YAC vectors may be used that will accept genomic DNA that has been digested to 
completion with rarely cutting restriction enzymes such as Notl or Mlul. 

In addition, insect cells, such as Spodoptera frugiperda; chicken cells, such as 
E3C/0 and SL-29; mammalian cells, such as HeLa, Chinese hamster ovary cells (CHO), 
COS-7 or MDCK cells and the like may also be used. The foregoing list is illustrative only 

1 o and is not intended in any way to limit the types of host cells suitable for expression of the 

nucleic acid sequences of the invention. 

As used herein, expression vectors refer to any type of vector that can be 
manipulated to contain a nucleic acid sequence coding for ABCR, such as plasmid expression 
vectors, viral vectors, and yeast expression vectors. The selection of the expression vector is 
15 based on compatibility with the desired host cell such that expression of the nucleic acid 
encoding ABCR results. Plasmid expression vectors comprise a nucleic acid sequence of the 
invention operably linked with at least one expression control element such as a promoter. 
In general, plasmid vectors contain replicon and control sequences derived from species 
compatible with the host cell To facilitate selection of plasmids containing nucleic acid 

2 0 sequences of the invention, plasmid vectors may also contain a selectable marker such as a 

gene coding for antibiotic resistance. Suitable examples include the genes coding for 
ampicillin, tetracycline, chloramphenicol, or kanamycin resistance. 

Suitable expression vectors, promoters, enhancers, and other expression control 
elements are known in the art and may be found in Sambrook el aL 9 Molecular Cloning: A 
25 Laboratory Manual, second edition, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY (1989), incorporated herein by reference in its entirety. 

Transformed host cells containing a DNA sequence encoding ABCR may then 
be grown in an appropriate medium for the host. The cells are then grown until product 
accumulation reaches desired levels at which time the cells are then harvested and the protein 
30 product purified in accordance with conventional techniques. Suitable purification methods 
include, but are not limited to, SDS PAGE electrophoresis, phenylboronate-agarose, reactive 
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green 19-agarose, concanavalin A sepharose, ion exchange chromatography, affinity 
chromatography, electrophoresis, dialysis and other methods of purification known in the art. 

Protein preparations, of purified or unpurified ABCR by host cells, are 
accordingly produced which comprise ABCR and other material such as host cell components 
5 and/or cell medium, depending on the degree of purification of the protein. 

The invention also includes a transgenic non-human animal, including and not 
limited to mammals, such as and not limited to a mouse, rat, or hamster, comprising a 
sequence encoding ABCR, or fragment thereof that substantially retains ABCR activity, 
introduced into the animal or an ancestor of the animal. The sequence may be wild-type or 
1 0 mutant and may be introduced into the animal at the embryonic or adult stage. The sequence 
is incorporated into the genome of an animal such that it is chromosomally incorporated into 
an activated state. A transgenic non-human animal has germ cells and somatic cells that 
contain an ABCR sequence. Embryo cells may be transfected with the gene as it occurs 
naturally, and transgenic animals are selected in which the gene has integrated into the 
15 chromosome at a locus which results in activation. Other activation methods include 
modifying the gene or its control sequences prior to introduction into the embryo. The 
embryo may be transfected using a vector containing the gene. 

In addition, a transgenic non-human animal may be engineered wherein ABCR 
is suppressed. For purposes of the present invention, suppression of ABCR includes, and is 

2 0 not limited to strategies which cause ABCR not to be expressed. Such strategies may include 

and are not limited to inhibition of protein synthesis, pre-mRNA processing, or DNA 
replication. Each of the above strategies may be accomplished by antisense inhibition of 
ABCR gene expression. Many techniques for transferring antisense sequences into cells are 
known to those of skill, including and not limited to microinjection, viral-mediated transfer, 
25 somatic cell transformation, transgene integration, and the like, as set forth in Pinkert, Carl, 
Transgenic Animal Technology* 1994, Academic Press, Inc., San Diego, CA, incorporated 
herein by reference in its entirety. 

Further, a transgenic non-human animal may be prepared such that ABCR is 
knocked out. For purposes of the present invention, a knock-out includes and is not limited 

3 0 to disruption or rendering null the ABCR gene. A knock-out may be accomplished, for 
: example, with antisense sequences for ABCR. The ABCR gene may be knocked out by 
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injection of an antisense sequence for all or part of the ABCR sequence such as an antisense 
sequence for all or part of SEQ ID NO: 2. Once ABCR has been rendered null, correlation of 
the ABCR to macular degeneration may be tested. Sequences encoding mutations affecting 
the ABCR may be inserted to test for alterations in various retinal and macular degenerations 
5 exhibited by changes in the characteristics associated with retinal and macular degeneration. 

An ABCR knock-out may be engineered by inserting synthetic DNA into the 
animal chromosome by homologous recombination. In this method, sequences flanking the 
target and insert DNA are identical, allowing strand exchange and crossing over to occur 
between the target and insert DNA. Sequences to be inserted typically include a gene for a 
10 selectable marker, such as drug resistance. Sequences to be targeted are typically coding 
regions of the genome, in this case part of the ABCR gene. In this process of homologous 
recombination, targeted sequences are replaced with insert sequences thus disrupting the 
targeted gene and rendering it nonfunctional. This nonfunctional gene is called a null allele 
of the gene. 

15 To create the knockout mouse, a DNA construct containing the insert DNA and 

flanking sequences is made. This DNA construct is transfected into pluripotent embryonic 
stem cells competent for recombination. The identical flanking sequences align with one 
another, and chromosomal recombination occurs in which the targeted sequence is replaced 
with the insert sequence, as described in Bradley, A., Production and Analysis of Chimeric 

20 Mice, in Teratocarcinomas and Embryonic Stem Cells - A Practical Approach, 1987, E. 
Roberson, Editor, IRC Press, pages 113-151. The stem cells are injected into an embryo, 
which is then implanted into a female animal and allowed to be born. The animals may 
contain germ cells derived from the injected stem cells, and subsequent matings may produce 
animals heterozygous and homozygous for the disrupted gene. 

25 Transgenic non-human animals may also be useful for testing nucleic acid 

changes to identify additional mutations responsible for macular degeneration. A transgenic 
non-human animal may comprise a recombinant ABCR. 

The present invention is also directed to gene therapy. For purposes of the 
present invention, gene therapy refers to the transfer and stable insertion of new genetic 

30 information into cells for the therapeutic treatment of diseases or disorders. A foreign 
sequence or gene is transferred into a cell that proliferates to spread the new sequence or gene 
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throughout the cell population. Sequences include antisense sequence of all or part of ABCR, 
such as an antisense sequence to all or part of the sequences identified as SEQ ID NO: 1, 2, 
and 5. Known methods of gene transfer include microinjection, electroporation, liposomes, 
chromosome transfer, transfection techniques, calcium-precipitation transfection techniques, 
5 and the like. In the instant case, macular degeneration may result from a loss of gene function, 
as a result of a mutation for example, or a gain of gene function, as a result of an extra copy 
of a gene, such as three copies of a wild-type gene, or a gene over expressed as a result of a 
mutation in a promoter, for example. Expression may be altered by activating or deactivating 
regulatory elements, such as a promoter. A mutation may be corrected by replacing the 

1 0 mutated sequence with a wild-type sequence or inserting an antisense sequence to bind to an 
over expressed sequence or to a regulatory sequence. 

Numerous techniques are known in the art for the introduction of foreign genes 
into cells and may be used to construct the recombinant cells for purposes of gene therapy, in 
accordance with this embodiment of the invention. The technique used should provide for the 

15 stable transfer of the heterologous gene sequence to the stem cell, so that the heterologous 
gene sequence is heritable and expressible by stem cell progeny, and so that the necessary 
development and physiological functions of the recipient cells are not disrupted. Techniques 
which may be used include but are not limited to chromosome transfer (e.g., cell fusion, 
chromosome-mediated gene transfer, micro cell-mediated gene transfer), physical methods 

2 0 (e.g., transfection, spheroplast fusion, microinjection, electroporation, liposome carrier), viral 

vector transfer (e.g., recombinant DNA viruses, recombinant RNA viruses) and the like 
(described in Cline, M. J., 1985, Pharmac. Ther. 29:69-92, incorporated herein by reference 
in its entirety). 

The term "purified", when used to describe the state of nucleic acid sequences 
25 of the invention, refers to nucleic acid sequences substantially free of nucleic acid not coding 
for ABCR or other materials normally associated with nucleic acid in non-recombinant cells, 
i.e., in its "native state." 

The term "purified" or "in purified form" when used to describe the state of an 
ABCR nucleic acid, protein, polypeptide, or amino acid sequence, refers to sequences 

3 0 substantially free, to at least some degree, of cellular material or other material normally 

associated with it in its native state. Preferably the sequence has a purity (homogeneity) of 
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at least about 25% to about 100%, More preferably the purity is at least about 50%, when 
purified in accordance with standard techniques known in the art. 

In accordance with methods of the present invention, methods of detecting 
retinal or macular degenerations in a patient are provided comprising obtaining a patient tissue 
5 sample for testing. The tissue sample may be solid or liquid, a body fluid sample such as and 
not limited to blood, skin, serum, saliva, sputum, mucus, bone marrow, urine, lymph, and a 
tear; and feces. In addition, a tissue sample from amniotic fluid or chorion may be provided 
for the detection of retinal or macular degeneration in utero in accordance with the present 
invention. 

10 A test fragment is defined herein as an amplified sample comprising ABCR- 

specific nucleic acids from a patient suspected of having retinal or macular degeneration. A 
control fragment is an amplified sample comprising normal or wild-type ^5Cif-specific 
nucleic acids from an individual not suspected of having retinal or macular degeneration. 

The method of amplifying nucleic acids may be the polymerase chain reaction 

1 5 using a pair of primers wherein at least one primer within the pair is selected from the group 
consisting of SEQ ID NOS: 12-113. When the polymerase chain reaction is the amplification 
method of choice, a pair of primers may be used such that one primer of the pair is selected 
from the group consisting of SEQ ID NOS: 12-1 13. 

Nucleic acids, such as DNA (such as and not limited to genomic DNA and 

2 0 cDNA) and/or RNA (such as and not limited to mRNA), are obtained from the patient sample. 
Preferably RNA is obtained. 

Nucleic acid extraction is followed by amplification of the same by any 
technique known in the art. The amplification step includes the use of at least one primer 
sequence which is complementary to a portion of .45Cif-specific expressed nucleic acids or 

25 sequences on flanking intronic genomic sequences in order to amplify exon or coding 
sequences. Primer sequences useful in the amplification methods include and are not limited 
to SEQ ID NOS: 12-113, which may be used in the amplification methods. Any primer 
sequence of about 10 nucleotides to about 35 nucleotides, more preferably about 15 
nucleotides to about 30 nucleotides, even more preferably about 17 nucleotides to about 25 

30 nucleotides may be useful in the amplification step of the methods of the present invention. 
In addition, mismatches within the sequences identified above, which achieve the methods of 
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the invention, such that the mismatched sequences are substantially complementary and thus t 
hybridizable to the sequence sought to be identified, are also considered within the scope of 
the disclosure. Mismatches which permit substantial similarity to SEQ ID NOS: 12-113, such 
as and not limited to sequences with similar hydrophobicity, hydrophilicity, basicity, and 
5 acidity, will be known to those of skill in the art once armed with the present disclosure. The 
primers may also be unmodified or modified. Primers may be prepared by any method known 
in the art such as by standard phosphoramidite chemistry. See Sambrook et aL y supra. 

The method of amplifying nucleic acids may be the polymerase chain reaction 
using a pair of primers wherein at least one primer within the pair is selected from the group 

1 0 consisting of SEQ ID NOS : 1 2- 1 1 3 . When the polymerase chain reaction is the amplification 
method of choice, a pair of primers may be used such that one primer of the pair is selected 
from the group consisting of SEQ ID NOS: 12-113. 

When an amplification method includes the use of two primers, a first primer 
and a second primer, such as in the polymerase chain reaction, one of the first primer or 

15 second primer may be selected from the group consisting of SEQ ID NOS: 12-113. Any 
primer pairs which copy and amplify nucleic acids between the pairs pointed toward each 
other and which are specific for ABCR may be used in accordance with the methods of the 
present invention. 

A number of template dependent processes are available to amplify the target 
2 0 sequences of interest present in a sample. One of the best known amplification methods is the 
polymerase chain reaction (PCR) which is described in detail in U.S. Patents 4,683,195, 
4,683,202 and 4,800,159, and in Innis et aL, PCR Protocols, Academic Press, Inc., San Diego 
CA, 1990, each of which is incorporated herein by reference in its entirety. Briefly, in PCR, 
two primer sequences are prepared which are complementary to regions on opposite 

2 5 complementary strands of the target sequence. An excess of deoxynucleoside triphosphates 

are added to a reaction mixture along with a DNA polymerase (e.g., Tag polymerase). If the 
target sequence is present in a sample, the primers will bind to the target and the polymerase 
will cause the primers to be extended along the target sequence by adding on nucleotides. By 
raising and lowering the temperature of the reaction mixture, the extended primers will 

3 0 dissociate from the target to form reaction products, excess primers will bind to the target and 

to the reaction products and the process is repeated. Alternatively, a reverse transcriptase PCR 
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amplification procedure may be performed in order to quantify the amount of mRNA 
amplified. Polymerase chain reaction methodologies are well known in the art. 

Another method for amplification is the ligase chain reaction (referred to as 
LCR), disclosed in EPA No. 320,308, incorporated herein by reference in its entirety. In 
5 LCR, two complementary probe pairs are prepared, and in the presence of the target sequence, 
each pair will bind to opposite complementary strands of the target such that they abut. In the 
presence of a ligase, the two probe pairs will link to form a single unit. By temperature 
cycling, as in PCR, bound ligated units dissociate from the target and then serve as "target 
sequences" for ligation of excess probe pairs. U.S. Patent 4,883,750, incorporated herein by 
10 reference in its entirety, describes an alternative method of amplification similar to LCR for 
binding probe pairs to a target sequence. 

Qbeta Replicase, described in PCT Application No. PCT/US87/00880, 
incorporated herein by reference in its entirety, may also be used as still another amplification 
method in the present invention. In this method, a replicative sequence of RNA which has a 
15 region complementary to that of a target is added to a sample in the presence of an RNA 
polymerase. The polymerase will copy the replicative sequence which can then be detected. 

An isothermal amplification method, in which restriction endonucleases and 
ligases are used to achieve the amplification of target molecules that contain nucleotide 5*-[ 
alpha -thio]triphosphates in one strand of a restriction site (Walker, G. T., et al 9 Proc. Nail. 
2 0 Acad Sci (U.S.A.) 1992, 59:392-396, incorporated herein by reference in its entirety), may 
also be useful in the amplification of nucleic acids in the present invention. 

Strand Displacement Amplification (SDA) is another method of carrying out 
isothermal amplification of nucleic acids which involves multiple rounds of strand 
displacement and synthesis, i.e. nick translation. A similar method, called Repair Chain 
25 Reaction (RCR) is another method of amplification which may be useful in the present 
invention and which involves annealing several probes throughout a region targeted for 
amplification, followed by a repair reaction in which only two of the four bases are present. 
The other two bases can be added as biotinylated derivatives for easy detection. A similar 
approach is used in SDA. 
3 0 ^5Ci?-specific nucleic acids can also be detected using a cyclic probe reaction 

(CPR). In CPR, a probe having a 3* and 5' sequences of non-ABCR specific DNA and middle 
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sequence of ABCR specific RNA is hybridized to DNA which is present in a sample. Upon 
hybridization, the reaction is treated with RNaseH, and the products of the probe identified 
as distinctive products, generate a signal which is released after digestion. The original 
template is annealed to another cycling probe and the reaction is repeated. Thus, CPR 
5 involves amplifying a signal generated by hybridization of a probe to a ^SC7?-specific 
expressed nucleic acid. 

Still other amplification methods described in GB Application No. 2 202 328, 
and in PCT Application No. PCT/US89/01025, each of which is incorporated by reference in 
its entirety, may be used in accordance with the present invention. In the former application, 

1 0 "modified" primers are used in a PCR like, template and enzyme dependent synthesis. The 
primers may be modified by labeling with a capture moiety (e.g., biotin) and/or a detector 
moiety (e.g., enzyme). In the latter application, an excess of labeled probes are added to a 
sample. In the presence of the target sequence, the probe binds and is cleaved catalytically. 
After cleavage, the target sequence is released intact to be bound by excess probe. Cleavage 

15 of the labeled probe signals the presence of the target sequence. 

Other nucleic acid amplification procedures include transcription-based 
amplification systems (TAS) (Kwoh D., et ai, Proc. Natl Acad Sci (U.S.A.) 1989, 86:1 173, 
Gingeras T. R., et al y PCT Application WO 88/10315, each of which is incorporated herein 
by reference in its entirety), including nucleic acid sequence based amplification (NASBA) 

20 and 3 SR. In NASBA, the nucleic acids can be prepared for amplification by standard 
phenol/chloroform extraction, heat denaturation of a clinical sample, treatment with lysis 
buffer and minispin columns for isolation of DNA and RNA or guanidinium chloride 
extraction of RNA. These amplification techniques involve annealing a primer which has 
^5Ci?-specific sequences. Following polymerization, DNA/RNA hybrids are digested with 

2 5 RNase H while double stranded DNA molecules are heat denatured again. In either case the 

single stranded DNA is made fully double stranded by addition of second ^iJCJJ-specific 
primer, followed by polymerization. The double stranded DNA molecules are then multiply 
transcribed by a polymerase such as T7 or SP6. In an isothermal cyclic reaction, the RNAs 
are reverse transcribed into double stranded DNA, and transcribed once again with a 

3 0 polymerase such as T7 or SP6. The resulting products, whether truncated or complete, 

indicate ^5Ci?-specific sequences. 
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Davey, C, et al., European Patent Application Publication No. 329,822, 
incorporated herein by reference in its entirety, disclose a nucleic acid amplification process 
involving cyclically synthesizing single-stranded RNA ("ssRNA"), ssDNA, and 
double-stranded DNA ("dsDNA") which may be used in accordance with the present 
invention. The ssRNA is a first template for a first primer oligonucleotide, which is elongated 
by reverse transcriptase (RNA-dependent DNA polymerase). The RNA is then removed from 
resulting DNArRNA duplex by the action of ribonuclease H (RNase H, an RNase specific for 
RNA in a duplex with either DNA or RNA). The resultant ssDNA is a second template for 
a second primer, which also includes the sequences of an RNA polymerase promoter 
(exemplified by T7 RNA polymerase) 5' to its homology to its template. This primer is then 
extended by DNA polymerase (exemplified by the large "Klenow" fragment of E. coli DNA 
polymerase I), resulting as a double-stranded DNA ("dsDNA") molecule, having a sequence 
identical to that of the original RNA between the primers and having additionally, at one end, 
a promoter sequence. This promoter sequence can be used by the appropriate RNA 
polymerase to make many RNA copies of the DNA. These copies can then re-enter the cycle 
leading to very swift amplification. With proper choice of enzymes, this amplification can 
be done isothermally without addition of enzymes at each cycle. Because of the cyclical 
nature of this process, the starting sequence can be chosen to be in the form of either DNA or 
RNA. 

Miller, H. I., et al, PCT application WO 89/06700, incorporated herein by 
reference in its entirety, disclose a nucleic acid sequence amplification scheme based on the 
hybridization of a promoter/primer sequence to a target single-stranded DNA ("ssDNA") 
followed by transcription of many RNA copies of the sequence. This scheme is not cyclic; 
i.e. new templates are not produced from the resultant RNA transcripts. Other amplification 
methods include "race'* disclosed by Frohman, M. A., In: PCR Protocols: A Guide to Methods 
and Applications 1990, Academic Press, N.Y.) and "one-sided PCR" (Ohara, O., et al., Proc. 
Natl. Acad Sci. (U.S.A.) 1989, 56:5673-5677), all references herein incorporated by reference 
in their entirety. 

Methods based on ligation of two (or more) oligonucleotides in the presence 
of nucleic acid having the sequence of the resulting "di-oligonucleotide", thereby amplifying 
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the di-oligonucleotide (Wu, D. Y. et ai, Genomics 1989, 4:560, incorporated herein by 
reference in its entirety), may also be used in the amplification step of the present invention. 

Test fragment and control fragment may be amplified by any amplification 
methods known to those of skill in the art, including and not limited to the amplification 
5 methods set forth above. For purposes of the present invention, amplification of sequences 
encoding patient and wild-type ABCR includes amplification of a portion of a sequence such 
as and not limited to a portion of an ABCR sequence of SEQ ID NO: 1 , such as sequence of 
a length of about 10 nucleotides to about 1,000 nucleotides, more preferably about 10 
nucleotides to about 100 nucleotides, or having at least 10 nucleotides occurring anywhere 
1 o within the SEQ ID NO: 1 , where sequence differences are known to occur within ABCR test 
fragments. Thus, for example, a portion of the sequence encoding ABCR of a patient sample 
and a control sample may be amplified to detect sequence differences between these two 
sequences. 

Following amplification of the test fragment and control fragment, comparison 
1 5 between the amplification products of the test fragment and control fragment is carried out. 
Sequence changes such as and not limited to nucleic acid transition, transversion, and 
restriction digest pattern alterations may be detected by comparison of the test fragment with 
the control fragment. 

Alternatively, the presence or absence of the amplification product may be 

20 detected. The nucleic acids are fragmented into varying sizes of discrete fragments. For 
example, DNA fragments may be separated according to molecular weight by methods such 
as and not limited to electrophoresis through an agarose gel matrix. The gels are then 
analyzed by Southern hybridization. Briefly, DNA in the gel is transferred to a hybridization 
substrate or matrix such as and not limited to a nitrocellulose sheet and a nylon membrane. 

25 A labeled probe encoding an ABCR mutation is applied to the matrix under selected 
hybridization conditions so as to hybridize with complementary DNA localized on the matrix. 
The probe may be of a length capable of forming a stable duplex. The probe may have a size 
range of about 200 to about 10,000 nucleotides in length, preferably about 500 nucleotides in 
length, and more preferably about 2,454 nucleotides in length. Mismatches which permit 

3 0 substantial similarity to the probe, such as and not limited to sequences with similar 
hydrophobicity, hydrophilicity, basicity, and acidity, will be known to those of skill in the art 
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once armed with the present disclosure. Various labels for visualization or detection are 
known to those of skill in the art, such as and not limited to fluorescent staining, ethidium 
bromide staining for example, avidin/biotin, radioactive labeling such as 32 P labeling, and the 
like. Preferably, the product, such as the PCR product, may be run on an agarose gel and 
5 visualized using a stain such as ethidium bromide. See Sambrook et al, supra. The matrix 
may then be analyzed by autoradiography to locate particular fragments which hybridize to 
the probe. Yet another alternative is the sequencing of the test fragment and the control 
fragment to identify sequence differences. Methods of nucleic acid sequencing are known to 
those of skill in the art, including and not limited to the methods of Maxam and Gilbert, Proc. 
10 Natl Acad Scl f USA 1977, 74 9 560-564 and Sanger, Proc. Natl. Acad Sci., USA 1977, 74, 
5463-5467. 

A pharmaceutical composition comprising all or part of a sequence for ABCR 
may be delivered to a patient suspected of having retinal or macular degeneration. The 
sequence may be an antisense sequence. The composition of the present invention may be 

15 administered alone or may generally be administered in admixture with a pharmaceutical 
carrier. The pharmaceutically-acceptable carrier may be selected with regard to the intended 
route of administration and the standard pharmaceutical practice. The dosage will be about 
that of the sequence alone and will be set with regard to weight, and clinical condition of the 
patient. The proportional ratio of active ingredient to carrier will naturally depend, inter alia, 

20 on the chemical nature, solubility, and stability of the sequence, as well as the dosage 
contemplated. 

The sequences of the invention may be employed in the method of the 
invention singly or in combination with other compounds, including and not limited to other 
sequences set forth in the present invention. The method of the invention may also be used 

25 in conjunction with other treatments such as and not limited to antibodies, for example. For 
in vivo applications the amount to be administered will also depend on such factors as the age, 
weight, and clinical condition of the patient. The composition of the present invention may 
be administered by any suitable route, including as an eye drop, inoculation and injection, for 
example, intravenous, intraocular, oral, intraperitoneal, intramuscular, subcutaneous, topically, 

3 0 and by absorption through epithelial or mucocutaneous linings, for example, conjunctival, 
nasal, oral, vaginal, rectal and gastrointestinal. 
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The mode of administration of the composition may determine the sites in the 
organism to which the compound will be delivered. For instance, topical application may be 
administered in creams, ointments, gels, oils, emulsions, pastes, lotions, and the like. For 
parenteral administration, the composition may be used in the form of sterile aqueous or non- 
5 aqueous solution which may contain another solute, for example, sufficient salts, glucose or 
dextrose to make the solution isotonic. A non-aqueous solution may be comprise an oil, for 
example. For oral mode of administration, the present invention may be used in the form of 
tablets, capsules, lozenges, troches, powders, syrups, elixirs, aqueous solutions and 
suspension, and the like. Various disintegrants, such as starch, and lubricating agents may be 

10 used. For oral administration in capsule form, useful diluents are lactose and high molecular 
weight polyethylene glycols. When aqueous suspensions are required for oral use, certain 
sweetening and/or flavoring agents may be added. 

A diagnostic kit for detecting retinal or macular degeneration comprising in 
one or more containers at least one primer which is complementary to an ABCR sequence and 

15 a means for visualizing amplified DNA is also within the scope of the present invention. 
Alternatively, the kit may comprise two primers. In either case, the primers may be selected 
from the group consisting of SEQ ID NOS: 12-1 13, for example. The diagnostic kit may 
comprise a pair of primers wherein one primer within said pair is complementary to a region 
of the ABCR gene, wherein one of said pair of primers is selected from the group consisting 

20 of SEQ ID NO: 1 2-1 1 3, a probe specific to the amplified product, and a means for visualizing 
amplified DNA, and optionally including one or more size markers, and positive and negative 
controls. The diagnostic kit of the present invention may comprise one or more of a 
fluorescent dye such as ethidium bromide stain, 32 P, and biotin, as a means for visualizing or 
detecting amplified DNA. Optionally the kit may include one or more size markers, positive 

2 5 and negative controls, restriction enzymes, and/or a probe specific to the amplified product. 

The following examples are illustrative but are not meant to be limiting of the 

invention. 
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EXAMPLES: 

Identification of the ABCR as a Candidate Gene for STGD 

One of the 21 new human genes from the ABC superfamily, hereafter called 
ABCR (retina-specific ABC transporter), was identified (Allikmets et aL 1996) among 
5 expressed sequence tags (ESTs) obtained from 5,000 human retina cDNA clones (Wang, Y., 
Macke, J.P., Abella, B.S., Andreasson, K., Worley, P., Gilbert, D.J., Copeland, N.G., Jenkins, 
N.A., and Nathans, J. (1996)) and among ESTs obtained from human retina cDNA clones by 
the I.M.A.G.E. consortium (Lennon et aL, 1 996). ABCR is closely related to the previously 
described mouse and human i4.BC/ and ABC2 genes (Luciani et aL, 1994; Allikmets et aL, 

10 1995). To determine whether ABCR might cause a disease, the gene was mapped with a 
whole genome radiation hybrid panel (GeneBridge 4; Research Genetics, Huntsville, AL). 
ABCR mapped to the human chromosome Ipl3-p21 region, close to microsatellite markers 
D1S236 and D1S188. To define further the location of the gene, PCR primers, 3'UTR-For 
S^TCCTCTGACTCAGCAATCACA, SEQ ID NO: 7, and 3'UTR-Rev 

1 5 5TTGCAATTACAAATGCAATGG, SEQ ID NO: 8, from the putative 3' untranslated region 
were used to screen YACs from the previously described contig between these anonymous 
markers (Anderson et aL, 1995). At least 12 YACs contain the 3' end of the ABCR gene, 
including 924_e_9, 759_d_7, 775_c_2, 782_b_4, 982_g_5, 775_b_2, 765_a_3, 751_f_2, 
848_e_3, 943_h_8, 934jg_7, and 944_b_12 (Figure 1). These YACs delineate a region 

2 0 containing the STGD gene between markers D1S3361 and D1S236 (Anderson et aL, 1995). 

Expression of the ABO? Gene 

Additional support suggesting that ABCR is a candidate STGD gene came from 
expression studies and inspection of the EST databases. 

Searches of the dbEST (Boguski et aL, 1993) database were performed with 
2 5 BLAST on the NCBI file server (Altschul et aL, 1990). Amino acid alignments were 
generated with PILEUP (Feng and Doolittle, 1987). Sequences were analyzed with programs 
of the Genetics Computer Group package (Devereaux et aL y 1984) on a VAX computer. 

Clones corresponding to the mouse ortholog of the human ABCR gene were 
isolated from the mouse retina cDNA library and end-sequenced. The chromosomal location 
30 of the mouse ABCR gene was determined on The Jackson Laboratory (Bar Harbor, ME) 
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interspecific backcross mapping panel (C57BL/6JEi X SPRETYEi)Fl X SPRET/Ei (Rowe et 
ai, 1994) known as Jackson BSS. Mapping was performed by SSCP analysis with the 
primers MABCR1F 5 f ATC CAT ACC CTT CCC ACT CC, SEQ ID NO: 9, and MABCR1R 
5* GCA GCA GAA GAT AAG CAC ACC, SEQ ID NO. 10. The allele pattern of thcAbcr 
5 was compared to the 250 other loci mapped previously in the Jackson BSS cross 
(http://www.jax.org). 

DN A fragments used as probes were purified on a 1 % low-melting temperature 
agarose gel. The probe sequences are set forth within the genomic sequence of SEQ ID NO: 
1 and Figure 3. DNA was labeled directly in agarose with the Random Primed DNA Labeling 

10 Kit (Boehringer Mannheim, Indianapolis, IN) and hybridized to multiple tissue Northern blot 
and a Master blot (Clontech, Palo Alto, CA), according to the manufacturer's instructions. 
Each blot contained 2 |ig of poly A + RNA from various human tissues. Total RNA was 
isolated from adult rat tissues using the guanidinium thiocyanate method (Chomczynski and 
Saachi, 1987) and resolved by agarose gel electrophoresis in the presence of formaldehyde 

15 (Sambrook et aL 9 1989). Hybridization with the mouse ABCR probe was performed in 50% 
formamide, 5X SSC at 42°C, and filters were washed in 0.1X SSC at 68°C. 

Hybridization of a 3 1 ABCR cDNA probe to a multiple tissue Northern blot and 
a MasterBlot (Clontech, Palo Alto, C A) indicated that the gene was not expressed detectably 
in any of the 50 non-retinal fetal and adult tissues examined, consistent with the observation 

2 0 that all 12 of the ABCR clones in the EST database originated from retinal cDNA libraries. 
Furthermore, screening cDNA libraries from both developing mouse eye and adult human 
retina with ABCR probes revealed an estimated at 0.1%-1% frequency of ABCR clones of all 
cDNA clones in the library. Hybridization of the ABCR probe to a Northern blot containing 
total RNA from rat retina and other tissues showed that the expression of this gene is uniquely 

2 5 retina-specific (Figure 2). The transcript size is estimated to be 8 kb. 

Sequence and Exon/Intron Structure of the ABO? cDNA 

Several ESTs that were derived from retina cDNA libraries and had high 
similarity to the mouse Abel gene were used to facilitate the assembly of most of the ABCR 
cDNA sequence. Retina cDNA clones were linked by RT-PCR, and repetitive screening of 
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a human retina cDNA library with 3* and 5' PCR probes together with 5' RACE were used to 
characterize the terminal sequences of the gene. 

cDNA clones containing ABCR sequences were obtained from a human retina 
cDNA library (Nathans et al, 1986) and sequenced fully. Primers were designed from the 
5 sequences of cDNA clones from 5' and 3' regions of the gene and used to link the identified 
cDNA clones by RT-PCR with retina QUICK-Clone cDNA (Clontech, Palo Alto, CA) as a 
template. PCR products were cloned into pGEM*-T vector (Promega, Madison, WI). Mouse 
ABCR cDNA clones were obtained from screening a developing mouse eye cDNA library (H. 
Sun, A. Lanahan, and J. Nathans, unpublished). The pGEM*-T Vector is prepared by cutting 

1 0 pGEM*-5Zf(+ ) DNA with EcoR V and adding to a 3* terminal thymidine to both ends. These 
single 3'-T overhangs at the insertion site greatly improve the efficiency of ligation of PCR 
products because of the nontemplate-dependent addition of a single deoxyadenosine (A) to 
the 3*-ends of PCR products by many thermostable polymerases. The pGEM®-5Zf(+) Vector 
contains the origin of replication of the filamentous phage fl and can be used to produce 

15 ssDNA. The plasmid also contains T7 and SP6 RNA polymerase promoters flanking a 
multiple cloning region within the a-peptide coding region for the enzyme P-galactosidase. 
Insertional inactivation of the a-peptide allows recombinant clones to be identified directly 
by color screening on indicator plates. cDNA clones from various regions of the ABCR gene 
were used as probes to screen a human genomic library in Lambda FIX II (#946203, 

2 0 Stratagene, LaJolla, CA). Overlapping phage clones were mapped by EcoKl and BamUl 
digestion. A total of 6.9 kb of the ABCR sequence was assembled, (Figure 3) resulting in a 
6540 bp (2180 amino acid) open reading frame. 

Screening of a bacteriophage lambda human genomic library with cDNA 
probes yielded a contig that spans approximately 100 kb and contains the majority of the 

25 ABCR coding region. The exon/intron structure of all fifty one exons of the gene were 
characterized by direct sequencing of genomic and cDNA clones. Intron sizes were estimated 
from the sizes of PCR products using primers from adjacent exons with genomic phage clones 
as templates. 

Primers for the cDNA sequences of the ABCR were designed with the PRIMER 
30 program (Lincoln et al y 1991). Both ABCR cDNA clones and genomic clones became 
templates for sequencing. Sequencing was performed with the Taq Dyedeoxy Terminator 



BNSOOCID: <WO 9637764A1_L> 



WO 98/37764 



-27- 



~„, Seeing ^ were reso|ved on m Am 8 ^'T*^ 

Pnmers for amplified of indjviduai ^ SK|U «"«- 
20-50 * fem fc spliM si , e „ „ m fonh ^ *» «*— — 



Table 1 Exon/intron 
PRIMEfi 

ABCR.EXONl:F 
ABCR.EXONl:R 

10 ABCR.EXON2:F 
ABCR.EXON2:R 

ABCR.EXON3.F 
ABCR.EXON3:R 

ABCR.EXON4:F 
15 ABCR.EXON4.R 

ABCR.EXON5.F 
ABCR.EXON5.R 

ABCR.EXON6:F 
ABCR.EXON6.R 

20 ABCR.EXON7.F 
ABCR.EXON7.R 

ABCR.EXON8.F 
ABCR.EXON8:R 



Primers for ABCR 
SEOTfFivry 

ACCCTCTGCTAAGCTCAGAG 
ACCCCACACTTCCAACCTG 

AAGTCCTACTGCACACATGG 
ACACTCCCACCCCAAGATC 

TTCCCAAAAAGGCCAACTC 
CACGCACGTGTGCATTCAG 

GCTATTTCCTTATTAATGAGGC 
CCAACTCTCCCTGTTCTTTC 

TGTTTCCAATCGACTCTGGC 
TTCTTGCCTTTCTCAGGCTGG 

GTATTCCCAGGTTCTGTGG 
TACCCCAGGAATCACCTTG 

AGCATATAGGAGATCAGACTG 
TGACATAAGTGGGGTAAATGG 

GAGCATTGGCCTCACAGCAG 
CCCCAGGTTTGTTTCACC 



12 
13 

14 

15 

16 
17 

18 
19 

20 
21 

22 
23 

24 

25 

26 
27 
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Table 1 Exon/intron Primers for A6CR (continued) 

PRI MER SEQUENCE SEP ID NO 

ABCR.EXON9:F AGACATGTGATGTGGATACAC 28 

ABCRJEXON9:R GTGGGAGGTCCAGGGTACAC 29 

5 ABCR.EXON10:F AGGGGCAGAAAAGACACAC 30 

ABCR.EXON10:R TAGCGATTAACTCTTTCCTGG 31 

ABCR.EXONll:F CTCTTCAGGGAGCCTTAGC 32 

ABCR.EXON 1 1 :R TTCAAGACCACTTGACTTGC 3 3 

ABCR.EXON12:F TGGGACAGCAGCCTTATC 34 

10 ABCR.EXON12:R CCAAATGTAATTTCCCACTGAC 35 

ABCR.EXON13:F AATGAGTTCCGAGTCACCCTG 36 

ABCR.EXON13:R CCCATTCGCGTGTCATGG 37 

ABCR.EXON14:F TCCATCTGGGCTTTGTTCTC 38 

ABCR.EXON14:R AATCCAGGCACATGAACAGG 39 

15 ABCR.EXON15:F AGGCTGGTGGGAGAGAGC 40 

ABCR.EXON15:R AGTGGACCCCCTCAGAGG 41 

ABCR.EXON16:F CTGTTGCATTGGATAAAAGGC 42 

ABCR.EXON16:R GATGAATGGAGAGGGCTGG 43 

ABCR.EXON17:F CTGCGGTAAGGTAGGATAGGG 44 

20 ABCR.EXON17:R CACACCGTTTACATAGAGGGC 45 

ABCR.EXON18:F CCTCTCCCCTCCTTTCCTG 46 

ABCR.EXON18:R GTCAGTTTCCGTAGGCTTC 47 
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Table 1 Exon/intron Primers for ABCR (continued) 

PRIMER SEQUENCE SEP ID NO 

ABCR.EXON19:F TGGGGCCATGTAATTAGGC 48 

ABCR.EXON19:R TGGGAAAGAGTAGACAGCCG 49 

5 ABCR.EXON20:F ACTGAACCTGGTGTGGGG 50 

ABCR.EXON20:R TATCTCTGCCTGTGCCCAG 51 

ABCR.EXON21:F GTAAGATCAGCTGCTGGAAG 52 

ABCR.EXON21:R GAAGCTCTCCTGCACCAAGC 53 

ABCR.EXON22:F AGGTACCCCCACAATGCC 54 

10 ABCR.EXON22:R TCATTGTGGTTCCAGTACTCAG 55 

ABCR.EXON23:F TTTTTGCAACTATATAGCCAGG 56 

ABCR.EXON23:R AGCCTGTGTGAGTAGCCATG 57 

ABCR.EXON24:F GCATCAGGGCGAGGCTGTC 58 

ABCR.EXON24:R CCCAGCAATACTGGGAGATG 59 

15 ABCR.EXON25:F GGTAACCTCACAGTCTTCC 60 

ABCR.EXON25:R GGGAACGATGGCTTTTTGC 61 

ABCR.EXON26:F TCCCATTATGAAGCAATACC 62 

ABCR.EXON26:R CCTTAGACTTTCGAGATGG 63 

ABCR.EXON27:F GCTACCAGCCTGGTATTTCATTG 64 

20 ABCR.EXON27:R GTTATAACCCATGCCTGAAG 65 
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Table 1 Ewn/intron Primers for ABCR (continued) 

SEOirgjvr fl 



PEIMEE 

ABCR.EXON28.F 
ABCR.EXON28.R 

5 ABCR.EXON29.F 
ABCR.EXON29:R 

ABCR.EXON30.F 
ABCR.EXON30:R 

ABCR.EXON31:F 
10 ABCR.EXON3 1 :R 

ABCR.EXON32:F 
ABCR.EXON32:R 

ABCR.EXON33 :F 
ABCR.EXON33.R 

15 ABCR.EXON34:F 
ABCR.EXON34:R 

ABCR.EXON35:F 
ABCR.EXON35:R 

ABCR.EXON36.F 
2 0 ABCR.EXON36:R 



SEP TP | Sfn 
TGCACGCGCACGTGTGAC 66 

TGAAGGTCCCAGTGAAGTGGG 67 



CAGCAGCTATCCAGTAAAGG 
AACGCCTGCCATCTTGAAC 

GTTGGGCACAATTCTTATGC 
GTTGTTTGGAGGTCAGGTAC 

AACATCACCCAGCTGTTCCAG 
ACTCAGGAGATACCAGGGAC 

GGAAGACAACAAGCAGTTTCAC 
ATCTACTGCCCTGATCATAC 

AAGACTGAGACTTCAGTCTTC 
GGTGTGCCTTTTAAAAGTGTGC 

TTCATGTTTCCCTACAAAACCC 
CATGAGAGTTTCTCATTCATGG 

TGTTTACATGGTTTTTAGGGCC 
TTCAGCAGGAGGAGGGATG 

CCTTTCCTTCACTGATTTCTGC 
AATCAGCACTTCGCGGTG 



68 
69 

70 
71 

72 
73 

74 
75 

76 
77 

78 
79 

80 
81 

82 
83 
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Table 1 Exon/intron Primers for ABCR (continued) 

PRIMER SEQUENCE SEP ID NO 

ABCR.EXON37:F TGTAAGGCCTTCCCAAAGC 84 

ABCR.EXON37:R TGGTCCTTCAGCGCACACAC 85 

5 ABCR.EXON38:F CATTTTGCAGAGCTGGCAGC 86 

ABCR.EXON38:R CTTCTGTCAGGAGATGATCC 87 

ABCR.EXON39:F GGAGTGCATTATATCCAGACG 88 

ABCR.EXON39:R CCTGGCTCTGCTTGACCAAC 89 

ABCR.EXON40:F TGCTGTCCTGTGAGAGCATC 90 

10 ABCR.EXON40:R GTAACCCTCCCAGCTTTGG 91 

ABCR.EXON41:F CAGTTCCCACATAAGGCCTG 92 

ABCR.EXON41:R CAGTTCTGGATGCCCTGAG 93 

ABCR.EXON42:F GAAGAGAGGTCCCATGGAAAGG 94 

ABCR.EXON42.R GCTTGCATAAGCATATCAATTG 95 

15 ABCR.EXON43:F CTCCTAAACCATCCTTTGCTC 96 

ABCR.EXON43:R AGGCAGGCACAAGAGCTG 97 

ABCR.EXON44:F CTTACCCTGGGGCCTGAC 98 

ABCR.EXON44:R CTCAGAGCCACCCTACTATAG 99 

ABCR.EXON45:F GAAGCTTCTCCAGCCCTAGC 100 

20 ABCR.EXON45:R TGCACTCTCATGAAACAGGC 101 
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Table 1 Exon/intron Primers for ABCR (continued) 



PRIMER 

ABCR.EXON46:F 
ABCR.EXON46:R 



SEQUENCE SEP ID NO 

GTTTGGGGTGTTTGCTTGTC 102 
ACCTCTTTCCCCAACCCAGAG 103 



5 ABCR.EXON47:F 
ABCREXON47:R 



GAAGCAGTAATCAGAAGGGC 
GCCTCACATTCTTCCATGCTG 



104 
105 



ABCR.EXON48:F 
ABCREXON48:R 



TCACATCCCACAGGCAAGAG 106 
TTCCAAGTGTCAATGGAGAAC 107 



ABCR.EXON49:F 
10 ABCR.EXON49:R 



ATTACCTTAGGCCCAACCAC 
ACACTGGGTGTTCTGGACC 



108 
109 



ABCR.EXON50:F 
ABCR.EXON50:R 



GTGTAGGGTGGTGTTTTCC 
AAGCCCAGTGAACCAGCTGG 



110 
111 



ABCR.EXON51:F 
ABCR.EXON51:R 



TCAGCTGAGTGCCCTTCAG 112 
AGGTG AGC AAGTC AGTTTCGG 1 1 3 



15 In Table 1 , "F" indicates forward, i.e., 5' to 3', "R" indicates reverse, i.e., 3' to 

5'. PCR conditions were 95 °C for 8 minutes; 5 cycles at 62°C for 20 seconds, 72°C for 30 
seconds; 35 cycles at 60°C for 20 seconds, 72°C for 30 seconds; 72°C for 5 minutes (except 
that * was performed at 94° C for 5 minutes); 5 cycles at 94° C for 40 seconds; 60° C for 30 
seconds; 72° C for 20 seconds; 35 cycles at 94° C for 40 seconds; 56° C for 30 seconds; 72° 

2 0 C for 20 seconds, and 72 ° C for 5 minutes. 

Amplification of exons was performed with AmpliTaq Gold polymerase in a 
25 ul volume in IX PCR buffer supplied by the manufacturer (Perkin Elmer, Foster City, 
CA). Samples were heated to 95°C for 10 minutes and amplified for 35-40 cycles at 96°C 
for 20 seconds; 58°C for 30 seconds; and 72°C for 30 seconds. PCR products were analyzed 
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on 1-1 .5% agarose gels and in some cases digested with an appropriate restriction enzymes 
to verify their sequence. Primer sequences and specific reaction conditions are set forth in 
Table 1 . The sequence of the ABCR cDNA has been deposited with GenBank under accession 
# U88667. 

5 Homology to ABC Superfamily Members 

A BLAST search revealed that ABCR is most closely related to the previously 
characterized mouse Abel and Abel genes (Luciani et al, 1994) and to another human gene 
(ABCQ which maps to chromosome 16pl3.3 (Klugbauer and Hofmann, 1996). These genes, 
together with ABCR and a gene from C elegans (GenBank #Z291 17), form a subfamily of 

10 genes specific to multicellular organisms and not represented in yeast (Michaelis and 
Berkower, 1995; Allikmets et al., 1996). Alignment of the cDNA sequence of ABCR with the 
Abel, Abc2 f and ABCC genes revealed, as expected, the highest degree of homology within 
the ATP-binding cassettes. The predicted amino acid identity of the ABCR gene to mouse 
Abel was 70% within the ATP-binding domains; even within hydrophobic membrane- 

1 5 spanning segments, homology ranged between 55 and 85% (Figure 4). The putative ABCR 
initiator methionine shown in Figures 3 and 4 corresponds to a methionine codon at the 5' end 
of Abel (Luciani et al, 1994). 

ABCR shows the composition of a typical full-length ABC transporter that 
consists of two transmembrane domains (TM), each with six membrane spanning hydrophobic 

2 0 segments, as predicted by a hydropathy plot (data not shown), and two highly conserved ATP- 
binding domains (Figures 3 and 4). In addition, the HH1 hydrophobic domain, located 
between the first ATP and second TM domain and specific to this subfamily (Luciani et al, 
1994), showed a predicted 57% amino acid identity (24 of 42 amino acids) with the mouse 
Abel gene. 

2 5 To characterize the mouse ortholog of ABCR, cDN A clones from a developing 

mouse eye library were isolated. A partial sequence of the mouse cDNA was utilized to 
design PCR primers to map the mouse Abcr gene in an interspecific backcross mapping panel 
(Jackson BSS). The allele pattern of Abcr was compared to 2450 other loci mapped 
previously in the Jackson BSS cross; linkage was found to the distal end of chromosome 3 

30 (Figure 5). No recombinants were observed between Abcr and D13MU13. This region of the 
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mouse genome is syntenic with human chromosome Ipl3-p21. Thus far, no eye disease 
phenotype has been mapped to this region of mouse chromosome 3. 

Compound Heterozygous and Homozygous Mutations in STGD Patients 

One hundred forty-five North American and three Saudi Arabian families with 
5 STGD/FFM were examined. Among these, at least four were consanguineous families in 
which the parents were first cousins. Entry criteria for the characterization of the clinical and 
angiographic diagnosis of Stargardt disease, ascertainment of the families, and methodology 
for their collection, including the consanguineous families from Saudi Arabia, were as 
provided in Anderson et ai 9 1995; and Anderson, 1996. 

1 0 Mutational analysis of the ABCR gene was pursued in the above identified one 

hundred forty-eight STGD families previously ascertained by strict definitional criteria and 
shown to be linked to chromosome Ip (Anderson et al, 1995; Anderson, 1996). To date, all 
5 1 exons have been used for mutation analysis. 

Mutations were detected by a combined SSCP (Orita et aL, 1989) and 

15 heteroduplex analysis (White et aL, 1992) under optimized conditions (GlavaC and Dean, 
1993). Genomic DNA samples (50 ng) were amplified with AmpliTaq Gold polymerase in 
IX PCR buffer supplied by the manufacturer (Perkin Elmer, Foster City, CA) containing [a 
- 32 P] dCTP. Samples were heated to 95°C for 10 minutes and amplified for 35-40 cycles at 
96°C for 20 seconds; 58°C for 30 seconds; and 72°C for 30 seconds. Products were diluted 

20 in 1 :3 stop solution, denatured at 95 °C for 5 minutes, chilled in ice for 5 minutes, and loaded 
on gels. Gel formulations include 6% acrylamide:Bis (2.6% cross-linking), 10% glycerol at 
room temperature, 12W; and 10% acrylamide:Bis (1 .5% cross-linking), at 4°C, 70W. Gels 
were run for 2-16 hours (3000 Vh/100 bp), dried, and exposed to X-ray film for 2-12 hours. 
Some exons were analyzed by SSCP with MDE acrylamide (FMC Bioproducts, Rockland, 

25 ME) with and without 10% glycerol for 18 hours, 4 watts at room temperature with <x-P 32 - 
dCTP labeled DNA. Heteroduplexes were identified from the double-stranded DNA at the 
bottom of the gels, and SSCPs were identified from the single-stranded region. Samples 
showing variation were compared with other family members to assess segregation of the 
alleles and with at least 40 unrelated control samples, from either Caucasian or Saudi Arabian 

3 0 populations, to distinguish mutations from polymorphisms unrelated to STGD. PCR products 
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with SSCP or heteroduplex variants were obtained in a 25 |il volume, separated on a 1% 
agarose gel, and isolated by a DNA purification kit (PGC Scientific, Frederick, MD). 
Sequencing was performed on an ABI sequencer with both dye primer and dye terminator 
chemistry. 

5 Some mutations were identified with a heteroduplex analysis protocol (Roa et 

al, 1993). Equimolar amounts of control and patient PCR products were mixed in 0.2 ml 
tubes. Two volumes of PCR product from a normal individual served as a negative control, 
and MPZ exon 3 from patient BAB73 1 as a positive control (Roa et al., 1 996). Samples were 
denatured at 95 °C for 2 minutes and cooled to 35 °C at a rate of 1 °C/minute. Samples were 

10 loaded onto 1.0 mm thick, 40 cm MDE gels (FMC Byproducts, Rockland, ME), 
electrophoresed at 600-800 V for 15-20 hours, and visualized with ethidium bromide. 
Samples showing a variant band were reamplified with biotinylated forward and reverse 
primers and immobilized on streptavidin-conjugated beads (Warner et al. 1996). The 
resulting single strands were sequenced by the dideoxy-sequencing method with Sequenase 

1 5 2.0 (Amersham, Arlington Heights, IL). 

A total of seventy five mutations were identified, the majority representing 
missense mutations in conserved amino acid positions. However, several insertions and 
deletions representing frameshifts were also found (Table 2). The sequence of two mutations 
are shown in Figure 6 A and 6B. Two missense alterations (D847H, R943Q) were found in 

2 0 at least one control individual, suggesting that they are neutral polymorphisms. The 

remaining mutations were found in patients having macular degeneration and were not found 
in at least 220 unrelated normal controls (440 chromosomes), consistent with the 
interpretation that these alterations represent disease-causing mutations, not polymorphisms. 
One of the mutations, 5892+1 G-T, occurs in family AR144 in which one of the affected 
25 children is recombinant for the flanking marker D1S236 (Anderson et al, 1995). This 
mutation, however, is present in the father as well as in both affected children. Therefore, the 
ABCR gene is non-recombinant with respect to the Stargardt disease locus. 

The mutations are scattered throughout the coding sequence of the ABCR gene 
(see Table 2 and Figure 3), although clustering within the conserved regions of the ATP- 

3 0 binding domains is noticeable. Homozygous mutations were detected in three likely 

consanguineous families, two Saudi Arabian and one North American (Anderson et al. y 1 995), 
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in each of which only the affected individuals inherited the identical disease allele (Table 2; 
Figure 6C). Forty two compound heterozygous families were identified in which the two 
disease alleles were transmitted from different parents to only the affected offspring (Table 
2). 

5 Table 2. Mutations in theABCR gene in STGD Families 
Nucleotide Amino Acid tfFamilies EifllL 



0223T->G C75G 1 3 

0634C->T R212C 1 6 

0664dell3 fs 1 6 

10 0746A->G D249G 1 6 

1018T->G Y340D 2 8 

1411G->A E471K 1 11 

1569T->G D523E 1 12 

1715G->A R572Q 2 12 

15 1715G->C R572P 1 12 

1804C->T R602W 1 13 

1822T->A F608I 1 13 

1917C->A Y639X 1 13 

2453G->A G818E 1 16 

20 2461T->A W821R 1 16 

2536G->C D846H 1 16 

2588G->C G863A 11 17 

2791G->A V931M 1 19 

2827C->T R943W 1 19 

2 5 2884delC fs 1 19 

2894A->G N965S 3 19 

3083C->T A1028V 14 21 

3211delGT fs 1 22 

3212C->T S1071L 1 22 

30 3215T->C V1072A 1 22 

3259G->A E1087K 1 22 

3322C->T R1108C 6 22 

3364G->A E1122K 1 23 

3385G->T R1129C 1 23 

35 3386G->T R1129L 1 23 

3602T->G L1201R 1 24 

3610G->A D1204N 1 25 

4139C->T P1380L 2 28 

4195G->A E1399K 1 28 

40 4222T->C W1408R 3 28 

4232insTATG fs 1 28 

"4253+5G->T splice 1 28 

4297G->A V 14331 1 29 

4316G->A G1439D 1 29 
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Table 2. Mutations in thtABCR gene in STGD Families 
Nucleotide Amino Acid ^Families Exon 




4319T->C 


F1440S 1 


29 




4346G->A 


W1449X 1 


29 


5 


4462T->C 


C1488R 1 


30 




4469G->A 


C1490Y 1 


31 




4577C->T 


T1526M 6 


32 




4594G->A 


D1532N 2 


32 




4947delC 


fs 1 


36 


10 


5041dell5 


VVAIC1681del 1 


37 




5196+2T->C splice 1 


37 




5281del9 


PAL1761del 1 


38 




5459G->C 


R1820P 1 


39 




5512C->T 


H1838Y 1 


40 


15 


5527C->T 


R1843W 1 


40 




5585 + lG->A splice 1 


41 




5657G->A 


G1886E 1 


41 




5693G->A 


R1898H 4 


41 




5714+5G->A splice 8 


41 


20 


5882G->A 


G1961E 16 


43 




5898 + lG->A splice 3 


43 




5908C->T 


L1970F 1 


44 




5929G->A 


G1977S 1 


44 




6005 + 1G->T splice 1 


44 


25 


6079C->T 


L2027F 11 


45 




6088C->T 


R2030X 1 


45 




6089G->A 


R2030Q 1 


45 




6112C->T 


R2038W 1 


45 




6148G->C 


V2050L 2 


46 


30 


6166A->T 


K2056X 1 


46 




6229C->T 


R2077W 1 


46 




6286G->A 


E2096K 1 


47 




6316C->T 


R2106C 1 


47 




6391G->A 


E2131K 1 


48 


35 


6415C->T 


R2139W 1 


48 




6445C->T 


R2149X 1 


48 




6543del36 


1181dell2 1 


49 




6709delG 


fs 1 


49 



Mutations are named according to standard nomenclature. The column headed 
4 0 "Exon" denotes which of the 5 1 exons of ABCR contain the mutation. The column headed 
"# Families" denotes the number of Stargardt families which displayed the mutation. The 
column headed "Nucleotide" gives the base number starting from the A in the initiator ATG, 
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followed by the wild type sequence and an arrow indicating the base it is changed to; del 
indicates a deletion of selected bases at the given position in the ABCR gene; ins indicates an 
insertion of selected bases at the given position; splice donor site mutations are indicated by 
the number of the last base of the given exon, followed by a plus sign and the number of 
5 bases into the intron where the mutation occurs. The column headed "Amino Acid" denotes 
the amino acid change a given mutation causes; fs indicates a frameshift mutation leading to 
a truncated protein; splice indicates a splice donor site mutation; del indicates an in-frame 
deletion of the given amino acids. 

Mutations are named according to standard nomenclature. Exon numbering 
1 0 according to the nucleotide position starting from the A in the initiator ATG. 

In Situ Hybridization 

STGD is characterized histologically by a massive accumulation of a 
lipofuscin-like substance in the retinal pigment epithelium (RPE). This characteristic has led 

15 to the suggestion that STGD represents an RPE storage disorder (Blacharski et aL, 1 988). It 
was therefore of interest that ABCR transcripts were found to be abundant in the retina. To 
identify the site(s) of ABCR gene expression at higher resolution and to determine whether the 
gene is also expressed in the RPE, the distribution of ABCR transcripts was visualized by in 
situ hybridization to mouse, rat, bovine, and macaque ocular tissues. 

20 In situ hybridization with digoxigenin-labeled riboprobes was performed as 

described by Schaeren-Wiemers and Gerfin-Moser, 1993. For mouse and rat, unfixed whole 
eyes were frozen and sectioned; macaque retinas were obtained following cardiac perfusion 
with paraformaldehyde as described (Zhou et ai 9 1996). An extra incubation of 30 min in 1% 
Triton X-100, IX PBS was applied to the fixed monkey retina sections immediately after the 

2 5 acetylation step. The templates for probe synthesis were: (1) a 1 .6 kb fragment encompassing 
the 3' end of the mouse Abcr coding region, (2) a full length cDNA clone encoding the mouse 
blue cone pigment (Chiu et aL, 1994), and (3) a macaque rhodopsin coding region segment 
encoding residues 133 to 254 (Nickells, R. W., Burgoyne, C.F., Quigley, H.A., and Zack, D.J. 
(1995)). 

30 This analysis showed that ABCR transcripts are present exclusively within 

photoreceptor cells (Figure 7). ABCR transcripts are localized principally to the rod inner 
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segments, a distribution that closely matches that of rhodopsin gene transcripts. Interestingly, 
ABCR hybridization was not observed at detectable levels in cone photoreceptors, as judged 
by comparisons with the hybridization patterns obtained with a blue cone pigment probe 
(compare Figure 7A and Figure 7D, Figure 7E with Figure 7F and Figure 7G with Figure 7H). 
5 Because melanin granules might obscure a weak hybridization signal in the RPE of a 
pigmented animal, the distribution of ABCR transcripts was also examined in both albino rats 
and albino mice. In these experiments, the ABCR hybridization signal was seen in the 
photoreceptor inner segments and was unequivocally absent from the RPE (Figure 7E). Given 
that ABCR transcripts in each of these mammals, including a primate, are 
10 photoreceptor-specific, it is highly likely that the distribution of ABCR transcripts conforms 
to this pattern as well in the human retina. 

The disclosures of each patent, patent application and publication cited or 
described in this document are hereby incorporated herein by reference, in their entirety. 

Various modifications of the invention in addition to those shown and 
15 described herein will be apparent to those skilled in the art from the foregoing description. 
Such modifications are also intended to fall within the scope of the appended claims. 
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What is claimed is: 

1 . An isolated nucleic acid sequence encoding retina-specific ATP binding 
cassette transporter. 



2. An isolated nucleic acid sequence selected from the group consisting 
5 of SEQ ID NO: 1 , or a fragment thereof having substantially the same activity. 



3. An isolated nucleic acid sequence selected from the group consisting 
of SEQ ID NOS: 2 or 5, or a fragment thereof having substantially the same activity. 



4. An isolated amino acid sequence selected from the group consisting of 
SEQ ID NO: 3 or 6, or a fragment thereof having substantially the same activity. 

10 5. An isolated amino acid sequence of Figure 3, or a fragment thereof 

having substantially the same activity. 

6. A vector comprising a nucleic acid sequence encoding retina-specific 
ATP binding cassette transporter. 

7. A vector comprising a nucleic acid sequence selected from the group 
15 consisting of SEQ ID NO: 1 , or a fragment thereof having substantially the same activity. 



8. A vector comprising a nucleic acid sequence selected from the group 
consisting of SEQ ID NOS: 2 or 5, or a fragment thereof having substantially the same 
activity. 

9. A vector comprising a nucleic acid sequence encoding an amino acid 
2 0 sequence selected from the group consisting of SEQ ID NOS : 3 or 6. 

1 0. A vector comprising a nucleic acid sequence encoding the amino acid 
sequence of Figure 3. 



BNSDOCIQ <WO 9637764A1J_> 




WO 98/37764 



PCT/US98/03895 



- 52 - 

11. A host cell capable of expressing a nucleic acid sequence encoding a 
retina-specific ATP binding cassette transporter. 

12. A host cell capable of expressing a nucleic acid sequence of SEQ ID 

NO: 1. 

5 1 3 . A host cell capable of expressing a nucleic acid sequence selected from 

the group consisting of SEQ ID NOS: 2 or 5. 

14. A host cell capable of expressing a nucleic acid seqeunce encoding an 
amino acid sequence selected from the group consisting of SEQ ID NOS: 3 or 6. 

15. A host cell capable of expressing a nucleic acid seqeunce encoding the 
1 0 amino acid sequence of Figure 3 . 

16. A cell culture capable of expressing a retina-specific ATP binding 
cassette transporter. 

1 7. A cell culture capable of expressing a nucleic acid sequence of SEQ ID 

NO: 1. 

15 1 8. A cell culture capable of expressing a nucleic acid sequence selected 

from the group consisting of SEQ ID NOS: 2 or 5. 

1 9. A cell culture capable of expressing a nucleic acid sequence encoding 
an amino acid sequence selected from the group consisting of SEQ ID NOS: 3 or 6. 

20. A cell culture of claim 19 obtained by transforming a cell with an 
2 0 expression vector comprising a nucleic acid sequence selected from the group consisting of 
, SEQ ID NOS: 2 or 5. 
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21. A cell culture capable of expressing a nucleic acid sequence encoding 
an amino acid sequence selected from the group consisting of SEQ ID NOS: 3 or 6. 

22. A protein preparation comprising an amino acid sequence for retina- 
specific ATP binding cassette transporter. 

5 23. A protein preparation comprising an amino acid sequence encoded by 

a sequence of SEQ ID NO: 1 . 

24. A protein preparation comprising an amino acid sequence encoded by 
a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 2 or 5. 

25 . A protein preparation comprising an amino acid sequence selected from 
10 the group consisting of SEQ ID NOS: 3 or 6. 

26. A protein preparation comprising an amino acid sequence of Figure 3. 

27. A composition comprising an effective amount of a sequence selected 
from the group consisting of SEQ ID NOS: 2 or 5 or a fragment thereof having substantially 
similar activity, and a pharmaceutical^ acceptable carrier. 

15 28. A composition comprising an effective amount of an antisense 

sequence to a sequence selected from the group consisting of SEQ ID NOS: 2 or 5 or a 
fragment thereof which fragment has substantially similar activity, and a pharmaceutical^ 
acceptable carrier. 

29. A method of screening for an agent that alters retina-specific ATP 
2 0 binding cassette transporter comprising combining purified retina-specific ATP binding 
cassette transporter and at least one agent suspected of altering retina-specific ATP binding 
cassette transporter and observing an alteration in said purified retina-specific ATP binding 
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30. The method of claim 29 wherein said alteration is activation of said 
purified retina-specific ATP binding cassette transporter observed by a inhibition of a 
characteristic associated with macular degeneration selected from the group consisting of 
inhibition of central visual impairment, inhibition of progressive bilateral atrophy of the 

5 macular retinal pigment epithelium, inhibition of progressive bilateral atrophy of the 
neuroepithelium, inhibition of macula flecks/inhibition of midretinal periphery flecks, and 
inhibition of retina-specific ATP binding cassette transporter transcripts in photoreceptor cells. 

31. The method of claim 30 wherein said macular degeneration is selected 
from the group consisting of Stargardt Disease, Fundus Flavimaculatus, and age-related 

10 macular degeneration. 

32. A method of claim 29 wherein said alteration is an inhibition of said 
purified retina-specific ATP binding cassette transporter observed by a characteristic 
associated with macular degeneration selected from the group consisting of central visual 
impairment, bilateral atrophy of the macular retinal pigment epithelium, bilateral atrophy of 

15 the neuroepithelium, macula flecks, midretinal periphery flecks, and retina-specific ATP 
binding cassette transporter transcripts in photoreceptor cells. 

33 . A method of screening for an agent that inhibits macular degeneration 
comprising combining purified retina-specific ATP binding cassette transporter from a patient 
suspected of having macular degeneration and at least one agent suspected of activating retina- 

2 0 specific ATP binding cassette transporter and observing an activation in said purified retina- 
specific ATP binding cassette transporter. 

34. A method of screening for an agent that activates macular degeneration 
comprising combining a purified wild-type retina-specific ATP binding cassette transporter 
and at least one agent suspected of activating macular degeneration and observing an 

2 5 inhibition in said purified wild-type retina-specific ATP binding cassette transporter. 
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35, A transgenic non-human mammal comprising a recombinant sequence 
encoding a retina-specific ATP binding cassette transporter introduced into said mammal, or 
an ancestor of said mammal. 

5 36. The mammal of claim 35 wherein said sequence encoding said retina- 

specific ATP binding cassette transporter is selected from the group consisting of SEQ ID 
NOS: 1,2, and 5. 

37. A transgenic non-human mammal comprising a suppressed retina- 
specific ATP binding cassette transporter gene. 

10 38. A transgenic non-human mammal comprising a recombinant wild-type 

sequence encoding retina-specific ATP binding cassette transporter. 

39. The transgenic non-human mammal of claim 35 wherein said retina- 
specific ATP binding cassette transporter sequence is selected from the group consisting of 

15 SEQ ID NOS: 3 and 6. 

40. A diagnostic kit for detecting macular degeneration comprising in one 
or more containers a pair of primers, wherein one primer within said pair is complementary 
to a region of the retina-specific ATP binding cassette receptor, a probe specific to the 
amplified product, and a means for visualizing amplified DNA, and optionally including one 

2 0 or more size markers, and positive and negative controls. 

41. The diagnostic kit of claim 40 wherein said primer is selected from the 
group consisting of SEQ ID NOS: 12-1 13. 

42. The diagnostic kit of claim 40 wherein said primer is complementary 
to a region flanking an exon of retina-specific ATP binding cassette receptor genomic DNA 

25 sequence. 
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43. The diagnostic kit of claim 40 wherein said means for visualizing 
amplified DNA is selected from the group consisting of fluorescent stain, 32 P, and biotin. 

44. A method of detecting macular degeneration comprising: 
obtaining a sample comprising patient nucleic acids from a patient tissue 

5 sample; 

amplifying retina-specific ATP binding cassette receptor specific nucleic acids 
from said patient nucleic acids to produce a test fragment; 

obtaining a sample comprising control nucleic acids from a control tissue 

sample; 

10 amplifying control nucleic acids encoding wild-type retina-specific ATP 

binding cassette receptor to produce a control fragment; 

comparing the test fragment with the control fragment to detect the presence 
of a sequence difference in the test fragment, wherein a difference in said test fragment 
indicates macular degeneration. 

15 45 . The method of claim 44 wherein a sequence difference is selected from 

the group consisting of a missense mutation, an intragenic deletion, intragenic insertion, a 
splice donor site mutation, and a frameshift. 



46. The method of claim 44 wherein a sequence difference is a missense 



mutation. 



[2 0 47. The method of claim 44 wherein said amplification step comprises 

performing the polymerase chain reaction. 

48. The method of claim 47 wherein the polymerase chain reaction 
comprises using a pair of primers, wherein one primer within said pair is selected from the 
group consisting of SEQ ID NOS: 12-113. 
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49 . The method of claim 44 wherein said tissue sample is selected from the 
group consisting of blood, skin, serum, saliva, sputum, mucus, bone marrow, urine, lymph, 
a tear, chorion, and amniotic fluid. 

50. The method of claim 44 wherein said sequence difference is selected 
5 from the group consisting of 0223T-G, 0634OT, 0746A-G, 1018T-G, 141 1G- A, 

1569T-G, 1715G-A, 1715G-C, 1804OT, 1822T-A, 19170A, 2453G-A, 2461T-A, 
2536G-C, 2588G-C, 2791G-A, 28270T, 2894A-G, 3083OT, 3212C-T, 3215T-C, 
3259G-A, 3322C-T, 3364G-A, 3385G-T, 3386G-T, 3602T-G, 3610G-A, 4139C-T, 
4195G-A, 4222T-C, 4297G-A, 4316G-A, 4319T-C, 4346G-A, 4462T-C, 4469G-A, 
10 4577C-T, 4594G-A, 5041dell5, 5281del9, 5459G-C, 5512C-T, 5527C-T, 5657G-A, 
5693G-A, 5882G-A, 5908OT, 5929G-A, 6079OT, 6088C-T, 6089G-A, 61120T, 
6148G-C, 6166A-T, 62290T, 6286G-A, 6316C-T, 6391G-A, 6415C-T, 6445C-T, and 
6543del36. 

5 1 . The method of claim 44 further wherein said sequence difference 
15 results in an amino acid sequence difference selected from the group consisting of C75G, 

R212C, D249G, Y340D, E471K, D523E, R572Q, R572P, R602W, F6081, Y639X, G818E, 
W821R, D846H, G863A, V931M, R943W, N965S, A1028V, S1071L,V1072A, E1087K, 
R1108C, E1122K, R1129C, R1129L, L1201R, D1204N, P1380L, E1399K, W1408R, 
V1433I, G1439D, F1440S, W1449X, C1488R, C1490Y.T1526M, D1532N, WAlC1681del, 
20 PAL1761del, R1820P, H1838Y, R1843W, G1886E, R1898H, G1961E, L1970F, G1977S, 
L2027F, R2030X, R2030Q, R2038W, V2050L, K2056X, R2077W, E2096K, R2106C, 
E2131K, R2139W, R2149X, 1181dell2, 0664dell3, 2884delC, 4232insTATG, 4947delC, 
6709delG, 4253+5G-T, 5196+2T-C, 5585+1G-A, 5714+5G-A, 5898+1G-A, and 
6005+1 G-T. 

25 52. The method of claim 44 wherein said sequence difference results in a 

frame shift in the amino acid sequence. 
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53. The method of claim 44 wherein said sequence difference results in a 
splice site in the amino acid sequence. 

54 A sequence of having a sequence of SEQ ID NOS: 12, 13, 14, 15, 16, 
17.1M9L20L21.22.23.24.25.26 b 27.*^3a31.32.33.^H15. % 37.%^^ 

5 41 42 43 44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65, 

66V,68,69,70,71^^^ 

* 92, 93,94,95,96,97,98, 99 , 100, 101, 102, 103, 104, 105, !06, 107, 108, 109, HO, HI, 
112, or 113. 

55 A sequence encoding SEQ ID NO: 2 having a mutation selected from 
10 the group consisting of 0223T-0, 0634C-T, 0746A-G, 10.ST-G, 14, ,G- A, 1569T-G, 
,7,cl 17.5G-C, 1804C-T, 1822T-A, 1917C-A, 24530-A, 2461T-A, 2536G-C, 
Lg-C 2791G-A, 2827C-T, 2894A-G, 3083C-T, 32.2C-T, 3215T-C, 3259G-A, 
3322CT 3364G-A, 3385G-T, 3386G-T, 3602T-G, 3610G-A, 4.39C-T, 4.95G-A, 
4222T-C 4297G-A, 4316G-A, 4319T-C, 4346G-A, 4462T-C, 4469G-A, 4577C-T, 
15 4594G-A 504,de,15, 528,de,9. 5459G-C, 5512C-T, 5527C-T, 5657G-A 5693G-A, 
5882G-A, 5908C-T, 5929G-A, 0079C-T, 6088C-T, 6089G-A, «,*>T , 6,4^-C 
6166A-T, 6229C-T, 6286G-A, 6316C-T, 6391G-A, 64.5C-T, 6445C-T, and 6543del36. 

56 . A sequence of claim 55 wherein said sequence difference results in a 
frame shift in the amino acid sequence. 

57 . The method of claim 55 wherein said sequence difference results in a 
splice site in the amino acid sequence. 
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58. A sequence encoding SEQ ID NO: 3 having a mutation selected from 
the group consisting of C75G, R212C, D249G, Y340D, E471K, D523E, R572Q, R572P, 
R602W, F6081, Y639X, G818E, W821R, D846H, G863A, V931M, R943W, N965S, 
A1028V,S1071L,V1072A,E1087K,R1108C,E1122K,R1129C ( R1129L,L1201R,D1204N, 
5 P1380L, E1399K, W1408R, V1433I, G1439D, F1440S, W1449X, C1488R, C1490Y, 
T1526M, D1532N, WAIC1681del, PAL1761del, R1820P, H1838Y, R1843W, G1886E, 
R1898H, G1961E, L1970F, G1977S, L2027F, R2030X, R2030Q, R2038W, V2050L, 
K2056X, R2077W, E2096K, R2106C, E2131K, R2139W, R2149X, 1 181dell2, 0664dell3, 
2884delC, 4232insTATG, 4947delC, 6709delG, 4253+5G-T, 5196+2T-C, 5585+ 1G- A, 
10 5714+5G-A, 5898+lG-A,and 6005+1G-T. 
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-580 -560 . 540 

CCCCTACCCCTCTGCTAAGCTCAGGGATAACCCAACTAGCTGACCATAATGACTTCAGTC 
- 520 -500 -480 

ATTACGGAGCAAGATGAAAGACTAAAAGAGGGAGGGATCACTTCAGATCTGCCGAGTGAG 
-460 -440 .420 

TCGATTGGACTTAAAGGGCCAGTCAAACCCTGACTGCCGGCTCATGGCAGGCTCTTGCCG 
-400 -380 -360 

AGGACAAATGCCCAGCCTATATTTATGCAAAGAGATTTTGTTCCAAACTTAAGGTCAAAG 
" 34 ° -320 -300 

ATACCTAAAGACATCCCCCTCAGGAACCCCTCTCATGGAGGAGAGTGCCTGAGGGTCTTG 
"280 -260 -240 

GTTTCCCATTGCATCCCCCACCTCAATTTCCCTGGTGCCCAGCCACTTGTGTCTTTAGGG 
-220 -200 -180 

TTCTCTTTCTCTCCATAAAAGGGAGCCAACACAGTGTCGGCCTCCTCTCCCCAACTAAGG 
' 1S0 -140 -120 

GCTTATGTGTAATTAAAAGGGATTATGCTTTGAAGGGGAAAAGTAGCCTTTAATCACCAG 
-100 -80 -60 

GAGAAGGACACAGCGTCCGGAGCCAGAGGCGCTCTTAACGGCGTTTATGTCCTTTGCTGT 
-40 -20 o 

CCTGAGGGGCCTCAGCTCTGACCAATCTGGTCTTCGTGTGGTCATTAGCATGGGCTTCGT 

M G F V 

20 40 60 

G AG ACAG AT ACAG CTTTTG CTCTGGAAG AACTGG AC C CTGCGGAAAAGG C AAAAG | ATTCG 
RQIQLLLWKNWTLRKRQK IR 
80 100 120 

CTTTGTGGTGGAACTCGTGTGGCCnTTATCTTTATTTCTGGTCTTGATCTGGTTAAGGAA 
FVVEL VWPLSLFLVLIWLRN 
140 160 180 

TGCCAACCCGCTCTACAGCCATCATGAAT | GCCATTTCCCCAACAAGGCGATGCCCTCAGC 
ANPLYSHHEC HFPNKAMPSA 
200 220 240 

AGGAATGCTGCCGTGGCTCCAGGGGATCTTCTGCAATGTGAACAATCCCTGTTTTCAAAG 
GMLPWLQGIFCNVNNPCFQS 
2S0 280 300 

CCCCACCCCAGGAGAATCTCCTGGAATTGTGTCAAACTATAACAACTCCAT | CTTGGCAAG 
PTPGESPGIVSNYNNSI LAR 
320 340 360 

GGTATATCGAGATTTTCAAGAACTCCTCATGAATGCACCAGAGAGCCAGCACCTTGGCCG 
VYRDFQELLMNAPESQHLGR 
380 400 420 

TATTTGGACAGAGCTACACATCTTGTCCCAATTCATQ 

IWTELHILSQFMDTLRTHPE 
440 460 480 

GAGAATTGCAG | GAAGAGGAATACGAATAAGGGATATCTTGAAAGATGAAGAAACACTGAC 
RIAG RGIRIRDILKDEETLT 
500 520 540 

ACTATTTCTCATTAAAAACATCGGCCTGTCTGACTCAGTGGTCTACCTTCTGATCAACT 
LFLIKNIGLSDSVVYLLINS 

FIGURE 3A 
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560 580 600 

TCAAGTCCGTCCAGAGCAG | TTCGCTCATGGAGTCCCGGACCTGGCGCTGAAGGACATCGC 
QVRPEQ FAHGVPDLALKDI A 

620 640 660 

CTGCAGCGAGGCCCTCCTGGAGCGCTTCATCATCTTCAGCCAGAGACGCGGGGCAAAGAC 
CSEALLERFIIFSQRRGAKT 

680 700 720 

GGTGCGCTATGCCCTGTGCTCCCTCTCCCAGGGCACCCTACAGTGGATAGAAGACACTCT 
VRYAL CSLSQGTLQWIEDTL 
740 760 780 

GTATGCCAACGTGGACTTCTTCAAGCTCTTCCGTGTG | CTTCCCACACTCCTAGACAGCCG 
YANVDFFKLFRV LPTLLDS R 
800 820' 840 

TTCTCAAGGTATCAATCTGAGATCTTGGGGAGGAATATTATCTGATATGTCACCAAGAAT 
SQGINLRSWGGILSDMSPRI 
860 880 900 

TCAAGAG | TTTATCCATCGGCCGAGTATGCAGGACTTGCTG7GGGTGACCAGGCCCCTCAT 
QE F IHRPSMQDLLWVTRPLM 
920 940 960 

GCAGAATGGTGGTCCAGAGACCTTTACAAAGCTGATGGGCATCCTGTCTGACCTCCTGTG 
QNGGPETFTKLMG ILSDLLC 
980 1000 1020 

TGGCTACCCCGAGGGAGGTGGCTCTCGGGTGCTCTCCTTCAACTGGTATGAAGACAATAA 
GY PEGGGSRVLS FNWYEDNN 
1040 1060 1080 

CTATAAGGCCTTTCTGGGGATTGACTCCACAAGGAAGrGATCCTATCTATTCTTATGACAG 
YRAFL.GIDSTRKD PIYSYDR 
1100 1120 H40 

AAGAACAA) CATCCTTTTGTAATGCATTGATCCAGAGCCTGGAGTCAAATCCTTTAACCAA 
R T T S F C N - A LIQSLESNPLTK 
1160 1180 1200 

AATCGCTTGGAGGGCGGCAAAGCCTTTGCTGATGGGAAAAATCCTGTACACTCCTGATTC 
IAWRAAKPLLMGKILYTPDS 
1220 1240 1260 

ACCTGCAGCACGAAGGATACTGAAGAAT | GCCAACTCAACTTTTGAAGAACTGGAACACGT 
PAARRILKN ANSTFEELEHV 
1280 1300 1320 

TAGGAAGTTGGTCAAAGCCTGGGAAGAAGTAGGGCCCCAGATCTGGT 
RKLVKA WEEVG PQ I WYFFDN 
1340 1360 1380 

C AG CACACAGATGAACATGATCAGA | GATACCCTGGGGAACCCAACAGTAAAAGACTTTTT 
STQMNMIR DTLGNPTVKDFL 
1400 1420 1440 

GAATAGGCAGCTTGGTGAAGAAGGTATTACTGCTGAAGCCATCCTAAACrrTCCT 
NRQLGEEG I TAE A I LNFL Y K 
1460 1480 1500 

GGGCCCTCGGGAAAGCCAGGCTGACGACATGGCCAACTTCGACTGGAGGGAC^ 
G P R E SQADDMANFDWRDI FN 
1520 1540 1560 

CATCACTGATCGCACCCTCCGCCTGGTCAATCAATACCTGGAG | TGCTTGGTCCTGGATAA 
ITDRTLRLVNQYLE C LVLDK 
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3.580 1600 1620 

GTTTGAAAGCTACAATGATGAAACTCAGCTCACCCAACGTGCCCTCTCTCTACTGGAGGA 
FESYNDETQLTQRALSLLEE 

1640 1660 1680 

AAACATGTTCTGGGCCGGAGTGGTATTCCCTGACATGTATCCCTGGACCAGCTCTCTACC 
NMFWAGVVFPDMYPWTSS^P 

1700 1720 1740 

ACCCCACGTGAAGTATAAGATCCGAATGGACATAGACGTGGTGGAGAAAACCAATAAGAT 
PHVKYKIRMDIDVVEKTNKI 
1760 1 7 80 1800 

TAAAGACAG | GTATTGGGATTCTGGTCCCAGAGCTGATCCCGTGGAAGATTTCCGGTACAT 
KDR YWDSGPRADPVEDFRYI 
1820 1840 I860 

CTGGGGCGGGTTTGCCTATCTGCAGGACATGGTTGAACAGGGGATCACAAGGAGCCAGGT 
WGGFAYLQDMVEQGITRSQV 

1880 1900 1920 

GCAGGCGGAGGCTCCAGTTGGAATCTACCTCCAGCAGATGCCCTACCCCTGCTTCGTGGA 
QAEAPVGIYLQQMPYPCFVD 
1940 I960 1980 

CG ATT C | TTTCATGATCATCCTGAACCGCTGTTTCCCTATCTTCATGGTGCTGGCATGGAT 
DS FMIILNRCFPIFMVLAWI 
2000 2020 2040 

CTACTCTGTCTCCATGACTGTGAAGAGCATCGTCTTGGAGAAGGAGTTGCGACTGAAGGA 
YSVSM.TVKS IVLEKELRLKE 
2060 2080 2100 

GACCTTGAAAAATCAGGGTGTCTCCAATGCAGTGATTTGGTGTACCTGGTTCCTGGACAG 
TLKNQGVSNAVIWCTWFLDS 
2120 2140 2160 

CTTCTCCATCATGTCGATGAGCATCTTCCTCCTGACGATATTCATCATG | CATGGAAGAAT 
FSIMSMSIFLLTIFIM H GRI 
2180 2200 2220 

CCTACATTACAGCGACCCATTCATCCTCTTCCTGTTCTTGTTGGCTITCTCCACTGCCAC 
LHYSDPFILFLFLLAFSTAT 

2240 2260 2280 

CATCATGCTGTGCTITCTGCTCAGCACCTTCTTCTCCAAGGCCAGTCTGGCAGCAGCCT 
IMLCFLLSTFFSKASLAAAC 
2300 2320 2340 

TAGTGGTGTCATCTATTTCACCCTCTACCTGCCACACATCCrrGTGCTTCGCCTGGCAGGA 
SGVIYFTLYLPHILCFAWQD 
2360 2380 2400 

CCGCATGACCGCTGAGCTGAAGAAGGCTGTG | AGCTTACTGTCTCCGGTGGCATTTGGATT 
RMTAELKKAV S LLSPVAFGF 
2420 2440 2460 

TGGCACTGAGTACCTGGTTCGCTTTGAAGAGCAAGGCCTGGGGCTC 
G TEYLVRFEEQ GLGLQWSNI 
2480 2500 2520 

CGGGAACAGTCCCACGGAAGGGGACGAATTCAGCTTCCTGCTGTCCATGCAGATGATGCT 
GNSPTEGDEFSFLLSMQMMI* 
2540 2560 2580 

CCTTGATGCTGCGTGCTATGGCTTACTCGCTTGGTACCTTGATCAGGTGTTTCCAG | GAGA 
LDA ACYGLLAWYLDQVFPG D 
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2600 2620 2640 

CTATGGAACCCCACTTCCTTGGTACTTTCTTCTACAAGAGTCGTATTGGCTTGGCGGTGA 
YGTPLPWYFLLQESYWLGGE 
2660 2680 2700 

AG | GGTGTTCAACCAGAGAAGAAAGAGCCCTGGAAAAGACCGAGCCCCTAACAGAGGAAAC 
G CS TREERALEKTEPLTEET 
2720 2740 2760 . 

GGAGGATCCAGAGCACCCAGAAGGAATACACG | ACTCCTTCTTTGAACGTGAGCATCCAGG 
EDPEHPEGIHD SFFERE HPG 
27B0 2600 2820 

GTGGGTTCCTGGGGTATGCGTGAAGAATCTGGTAAAGATTTTTGAGCCCTGTGGCCGGCC 
WVPGVCVKNLVKI FE PCGRP 
2840 2860 2880 

AGCTGTGGACCGTCTGAACATCACCTTCTACGAGAACCAGATCACCGCATTCCTGGGCCA 
AVDRLNITFYENQ ITAFLGH 
2900 2920 2940 

CAATGGAGCTGGGAAAACCACCACCTT | GTCCATCCTGACGGGTCTGTTGCCACCAACCTC 
NGAGKTTTL SILTGLLPPTS 
2960 29B0 3000 

TGGGACTGTGCTCGTTGGGGGAAGGGACATTGAAACCAGCCTGGATGCAGTCCGGCAGAG 
GTVLVGGRDIETS LDAVRQS 
3020 3040 3060 

CCTTGGCATGTGTCCACAGCACAACATCCTGTTCCACCA | CCTCACGGTGGCTGAGCACAT 
LGMCPQHNILFHH LTVAEHM 
3080 3100 3120 

GCTGTTCTATGCCCAGCTGAAAGGAAAGTCCCAGGAGGAGGCCCAGCTGGAGATGGAAGC 
LFYAQLKGKSQEEAQLEMEA 
3140 3160 3180 

CATGTTGGAGGACACAGGCCTCCACCACAAGCGGAATGAAGAGGCTCAGGACCTATCAG | G 
MLEDTGLHHKRNEEAQDLSG 
3200 3220 3240 

TGGCATGCAGAGAAAGCTGTCGGTTGCCATTGCCTTTGTGGGAGATGCCAAGGTGGTGAT 
GMQRKLSVAIAFVGDAKVV I 
3260 3280 3300 

TCTGGACGAACCCACCTCTGGGGTGGACCCTTACTCGAGACGCTCAATCTGGGATCTGCT 
LDEPTSGVDPYSRRS IWDLL 
3320 3340 3360 

CCTGAAGTATCGCTCAG | GCAGAACCATCATCATGTCCACTCACCACATGGACGAGGCCGA 
LKYRSG RTIIMSTHHMDEAD 
3380 3400 3420 

CCTCCTTGGGGACCGCATTGCCATCATTGCCCAGGGAAGGCTCTACTGCTCAGGCACCCC 
LLGD RIAIIAQGRLYCSGTP 
3440 3460 3480 

ACTCTTCCTGAAGAACTGCTTTGGCACAGGCTTGTACTTAACCTTGGTG 
LP. LKNCFGTGLYLTLVRKMK 
3500 3520 3540 

AAACATCCAGAGCCAAAGGAAAGGCAGTGAG | GGGACCTGCAGCTGCTCGTCTAAGGGTTT 
NIQSQRKGSE GTCSCSSKGF 
3560 3580 3600 

CTCCACCACGTGTCCAGCCCACGTCGATGACCTAACTCCAGAACAAGTCCTGGATG | GGGA 
STTCPAHVDDLTP EQVLDG D 
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3620 3640 3660 

TGTAAATGAGCTGATGGATGTAGTTCTCCACCATGTTCCAGAGGCAAAGCTGGTGGAGTG 
VNELMDVVLHHVPEAKLVEC 

3680 3700 3720 

CATTGGTCAAGAACTTATCTTCCTTCTTCCAAATAAGAACTTCAAGCACAGAGCATATGC 
IGQ ELIFLLPNKNFKHRAVA 

3740 3760 3780 

CAGCCTTTTCAGAGAGCTGGAGGAGACGCTGGCTGACCTTGGTCTCAGCAGTTTTGGAAT 
SLFRELEETLADLGLSSFG I 
3800 3820 3840 

TTCTGACACTCCCCTGGAAGAG | ATTTTTCTGAAGGTCACGGAGGATTCTGATTCAGGACC 
SDTPLEE IFLKVTEDSDSGP 
3860 3880 3900 

TCTGTTTGCGG | GTGGCGCTCAGCAGAAAAGAGAAAACGTCAACCCCCGACACCCCTGCTT 
LFAG GAQQKRENVKPRHPCL 
3920 3940 3960 

GGGTCCCAGAGAGAAGGCTGGACAGACACCCCAGGACTCCAATGTCTGCTCCCCAGGGGC 
GPREKAGQTPQDSNVC S PGA 
3980 4000 4020 

GCCGGCTGCTCACCCAGAGGGCCAGCCTCCCCCAGAGCCAGAGTGCCCAGGCCCGCAGCT 
PAAHPEGQPPPEPECPGPQL 
4040 4060 4080 

CAACACGGGGACACAGCTGGTCCTCCAGCATGTGCAGGCGCTGCTGGTCAAGAGATTCCA 
NTGTQ LVLQHVQALLVKRFQ 
4100 4120 4140 

ACACACCATCCGCAGCCACAAGGACTTCCTGGCGCAG | ATCGTGCTCCCGGCTACCTTTGT 
HTIRSHKDFLAQ IVLPATFV 
4160 4180 4200 

GTTTTTGGCTCTGATGCTTTCTATTGTTATCCTTCCTTTTGGCGAATACCCCGCTTTGAC 
FLALMLS IVILPFGEYPALT 
4220 4240 4260 

CCTTCACCCCTGGATATATGGGCAGCAGTACACCTTCTTCAG | CATGGATGAACCAGGCAG 
LHPWIYGQQYTFFS MDEPGS 
4280 4300 4320 

TGAGCAGTTCACGGTACTTGCAGACGTCCTCCTGAATA 
EQFTVLADVLLNKPGFGNRC 
4340 4360 4380 

CCTGAAGGAAGGGTGGCTTCC | GGAGTACCCCTGTGGCAACTCAACACCCTGGAAGACTCC 
LKEGWLP EYPCGNSTPWKTP 
4400 4420 4440 

TTCTGTGTCCCCAAACATCACCCAGCTGTTCCAGAAGCAGAAATGGACACAGGTCAACCC 
SVSPNITQLFQKQKWTQVNP 
4460 4480 4500 

TTCACCATCCTGCAG | GTGCAGCACCAGGGAGAAGCTCACCATGCTGCCAGAGTGCCCCGA 
SPSCR CSTREK LTMLPECPE 
4520 4540 4560 

GGGTGCCGGGGGCCTCCCGCCCCCCCAG | AGAACACAGCGCAGCACGGAAATTCTACAAGA 
GAGGLPPPQ RTQRSTEILQD 

FIGURE 3B 
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4580 4600 4620 

CCTGACGGACAGGAACATCTCCGACTTCTTGGTAAAAACGTATCCTGCTCTTATAAGAAG 
LTDRNI SDFLVKTYPAL I RS 

4640 4660 4680 

CAG ! CTTAAAGAGCAAATTCTGGGTCAATGAACAGAG | GTATGGAGGAATTTCCATTGGAGG 
S LKSKFWVNEQR-YGGISIGG 
4700 4720 4740 

AAAGCTCCCAGTCGTCCCCATCACGGGGGAAGCACTTGTTGGGlT'ri'TAAGCGACCTTGG 
KLPVVPITGEALVGFLSDLG 
4760 4780 4800 

C CGG ATCATG AATGTG AG CGGG | GG C C CT AT CACTAGAGAGGCCTCTAAAGAAAT A CCTG A 
RIMNVSG GPITREASKEIPD 
4820 4840 4860 

TTTCCTTAAACATCTAGAAACTGAAGACAACATTA | AGGTGTGGTTTAATAACAAAGGCTG 
FLKHLETEDNIK VWFNNKGW 
4B80 4900 4920 

GCATGCCCTGGTCAGCTTTCTCAATGTGGCCCACAACGCCATCTTACGGGCCAGCCTGCC 
HAL VS FLNVAHNA I LRASLP 
4940 4960 4980 

TAAGGACAGGAGCCCCGAGGAGTATGGAATCACCGTCATTAGCCAACCCCTGAACCTGAC 
KDRSPEEYGITVI SQPLNLT 
5000 5020 5040 

CAAGGAGCAGCTCTCAGAGATTACAGT | GCTGACCACTTCAGTGGATGCTGTGGTTGCCAT 
KEQLSEITV LTTS VDAVVAI 
5060 5080 5100 

CTGCGTGATTTTCTCCATGTCCTTCGTCCCAGCCAGCTTTGTCC^ 
CVIFSMSFVPASFVLYLIQE 
5120 5140 5160 

GCGGGTGAACAAATCCAAGCACCTCCAGTTTATCAGTGGAGTGAGCCCCACCACCTACTG 
RVNKSKHLQFISGVSPTTYW 
5180 5200 5220 

GGTGACCAACTTCCTCTGGGACATC | ATGAATTATTCCGTGAGTGCTGGGCTGGTGGTGGG 
VTNFLWDI MNYSVSAGLVVG 
5240 5260 5280 

CATCTTCATCGGG^TrCAGAAGAAAGCCTACACTTCTCCAGAAAACCTTCCTGCCCTTGT 
IFIGFQKKAY TS PENLPALV 
5300 5320 5340 

GGCACTGCTCCTGCTGTATGG ) ATGGGCGGTCATTCCCATGATGTACCCAGCATCCTTCCT 
ALLLLYG WAVIPMMYPASFL 
5360 5380 5400 

GTTTGATGTCCCCAGCACAGCCTATGTGGCTTTATCTTGTGCTAATCTGTTCATCGGCAT 
FDVPSTA YV ALS CANLF I G I 
5420 5440 5460 

CAACAGCAGTGCTATTACCTTC^TCTTGGAATTATTTGATAATAACCGG | ACGCTGCTCAG 
NSSA ITFILELFDNNR TLLR 
5480 5500 5520 

GTTCAACGCCGTGCTGAGGAAGCTGCTCATTGTCTTCCCCCACTTCTGCCTGGGCCGGGG 
FNA VLRKLLIVFPH FCLGRG 
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5540 5560 5S80 

CCTCATTGACCTTGCACTGAGCCAGGCTGTGACAGATGTCTATGCCCGGTTTG ! GT3AGGA 
LIDLALSQAVTDVYARFG EE 
5600 5620 5640 

GCACTCTGCAAATCCGTTCCACTGGGACCTGATTGGGAAGAACCTGTTTGCCATGGTGGT 
HSANPFHWDLIGKNLFAMVV 
5660 5680 57 00 

GGAAGGGGTGGTGTACTTCCTCCTGACCCTGCTGGTCCAGCGCCACTTCTTCCTCTCCCA 
EGVVYFLLTLLVQRHFFLSQ 
5720 5740 576O 

ATG I GATTGCCGAGCCCACTAAGGAGCCCATTGTTGATGAAGATGATGATGTGGCTGAAGA 
W IAEPTKEPIVDEDDDVAEE 
5780 5800 - 5820 

AAGACAAAGAATTATTACTGGTGGAAATAAAACTGACATCTTAAGGCTACATGAACTAAC 
RQRIITGGNKTDILRLHELT 
5840 5860 5880 

CAAG I ATTTATCTGGGCACCTCCAGCCCAGCAGTGGACAGGCTGTGTGTCGGAGTTCGCCC 
K IYLGTSSPAVDRLCVGVRP 

59 00 5920 5940 

TGGAGAG | TG CTTTGG C CTCCTGGGAGTGAATGGTG C CGGCAAAACAAC CACATT CAAGAT 
GE CFGLLGVNGAGKTTTFKM 
5960 5980 6 000 

GCTCACTGGGGACACCACAGTGACCTCAGGGGATGCCACCGTAGCAGGCAAGAG I TATTTT 
LTG DTTVTSGDATVAGKS IL 
6020 6040 6060 

AACCAATATTTCTGAAGTCCATCAAAATATGGGCTACTGTCCTCAGTTTGATGCAATCGA 
TNISEVHQNMGYCPQFDAID 
6080 6100 6120 

TGAGCTGCTCACAGGACGAGAACATCTTTACCTTTATGCCCGGCTTCGAGGTGTACCAGC 
ELLTGREHLYLYARLRGVPA 
6140 6160 6leo 

AGAAGAAATCGAAAAG | GTTGCAAACTGGAGTATTAAGAGCCTGGGCCTGACTGTCTACGC 
EEIEK VANWSIKSLGLTVYA 
6200 6220 6240 

CGACTGCCTGGCTGGCACGTACAGTGGGGGCAACAAGCGGAAACTCTCCACAG CCATCGC 

DCLAGTYSGGNKRKLSTAIA 
6260 6280 6300 

ACTCATTGGCTGCCCACCGCTGGTGCTGCTG | GATGAGCCCACCACAGGGATGGACCCCCA 

LIGCPPLVLL DEPTTGMDPQ 

6320 6340 6 360 

GGCACGCCGCATGCTGTGGAACGTCATCGTGAGCATCATCAGAGAAGGGAGGGCTGTGGT 

ARRMLWNVIVSIIREGRAVV 
6380 6 400 6420 

CCTCACATCCCACAG | CATGGAAGAATGTGAGG CACTGTGTAC CCGG CTGG C CAT CATGGT 

L T .L« H S MEECE ALCTRLAIMV 
6440 « 4 60 6480 

AAAGGGCGCCTTTCGATGTATGGGCACCATTCAGCATCTCAAGTCCAA | ATTTGGAGATGG 
KGAFRCMGT1QHLKS K FGDG 
6500 6520 6540 

CTATATCGTCACAATGAAGATCAAATCCCCGAAGGACGACCTGCTTCCTGACCTGAACCC 
Y IVT MK IKSPKDDLLPDLNP 
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6560 6580 6600 

TGTGGAGCAGTTCTTCCAGGGGAACTTCCCAGGCAGTGTGCAGAGGGAGAGGCACTACAA 
VEQFFQGNFPGSVQRERHYN 

6620 6640 6660 

CATGCTCCAGTTCCAGGTCTCCTCCTCCTCCCTGGCGAGGATCTTCCAGCTCCTCCTrrC 
MLQFQVSSSSLARI FQLLLS 

6680 6700 6720 

CCACAAGGACAGCCTGCTCATCGAGGAGTACTCAGTCACACAGACCACACTGGACCAG I GT 
HKDSLLIEEYSVTQTTLDQ V 

6740 6760 6780 

GTTTGTAAATTTTGCTAAACAGCAGACTGAAAGTCATGACCTCCCTCTGCACCCTCGAGC 
FVNFAKQQTESHDLPLH PRA 

6800 6820 6840 

TGCTGGAGCCAGTCGAGAAGCCCAG | GACTGATCTTTCACACCGCTCGTTCCTGCAGCCAG 
AGASRQAQ D * 

6860 6880 6900 

AAAGGAACTCTGGGCAG CTGGAGGCGCAGGAGCCTGTGCCCATATGGTCATC CAAATGGA 

6920 6940 6960 

CTGGCCCAGCGTAAATGACCCCACTGCAGCAGAAAACAAACACACGAGGAGCATGCAGCG 

6980 7000 7020 

AATTCAGAAAGAGGTCTTTCAGAAGGAAACCGAAACTGACTTGCTCACCTGGAACACCTG 

7040 7060 7080 

ATGGTG AAAC CAAAC AAAT ACAAAATC CTTCTC C AGAC CC CAG AACTAG AAAC C C CGGG C 

7100 7120 7140 

CATCCCACTAGCAGCTTTGGCCTCCATATTGCTCTCATTTCAAGCAGATCT 

7160 7180 
CATGTTTGTCTGTGTGTCTGCGTTGTGTGTGATTTTCATGGAAA 

FIGURE 3H 
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